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GENES AND POLYMORPHISMS ASSOCIATED WITH 
CARDIOVASCULAR DISEASE AND THEIR USE 

RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. application Serial No. 
5 09/802,640, entitled "GENES AND POLYMORPHISMS ASSOCIATED 
WITH CARDIOVASCULAR DISEASE AND THEIR USE", filed on March 9, 
2001 by Andreas Braun, Aruna Bansal, and Patrick W. Kleyn. Where 
permitted the subject matter of this application is incorporated by 
reference in its entirety. 

10 FIELD OF THE INVENTION 

The field of the invention involves genes and polymorphisms of 
these genes that are associated with development of cardiovascular 
disease. Methods that use polymorphic markers for prognosticating, 
profiling drug response and drug discovery are provided. 

15 BACKGROUND OF THE INVENTION 

Diseases in all organisms have a genetic component, whether 
inherited or resulting from the body's response to environmental stresses, 
such as viruses and toxins. The ultimate goal of ongoing genomic 
research is to use this information to develop new ways to identify, treat 

20 and potentially cure these diseases. The first step has been to screen 
disease tissue and identify genomic changes at the level of individual 
samples. The identification of these "disease" markers has then fueled 
the development and commercialization of diagnostic tests that detect 
these errant genes or polymorphisms. With the increasing numbers of 

25 genetic markers, including single nucleotide polymorphisms (SNPs), 
microsatellites, tandem repeats, newly mapped introns and exons, the 
challenge to the medical and pharmaceutical communities is to identify 
genotypes that not only identify the disease but also follow the 
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progression of the disease and are predictive of an organism's response 
to treatment.' 

Polymorphisms 

Polymorphisms have been known since 1901 with the identification 
5 of blood types. In the 1950's they were identified on the level of proteins 
using large population genetic studies. In the 1980's and 1990's many 
of the known protein polymorphisms were correlated with genetic loci on 
genomic DNA. For example, the gene dose of the apolipoprotein E type 4 
allele was correlated with the risk of Alzheimer's disease in late onset 

10 families (see, e.g., Corder etal. (1993) Science 261 \ 921-923; mutation 
in blood coagulation factor V was associated with resistance to activated 
protein C (see, e.g., Bertina etal. (1994) Nature 36S:64-67); resistance 
to HIV-1 infection has been shown in Caucasian individuals bearing 
mutant alleles of the CCR-5 chemokine receptor gene (see, 

15 e.g., Samson et at. (1996) Nature 352:722-725); and a hypermutable 
tract in antigen presenting cells (APC, such as macrophages), has been 
identified in familial colorectal cancer in individuals of Ashkenzi jewish 
background (see, e.g., Laken etal. (1997) Nature Genet. 7 7:79-83). 
There may be more than three million polymorphic sites in the human 

20 genome. Many have been identified, but not yet characterized or mapped 
or associated with a disease. Polymorphisms of the genome can lead to 
altered gene function, protein function or mRNA instability. To identify 
those polymorphisms that have clinical relevance is the goal of a world- 
wide scientific effort. Discovery of such polymorphisms will have a 

25 fundamental impact on the identification and development of diagnostics 
and drug discovery. 

Single nucleotide polymorphisms (SNPs) 

Much of the focus of genomics has been in the identification of 
SNPs, which are important for a variety of reasons. They allow indirect 
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testing (association of haplotypes) and direct testing (functional variants). 
They are the most abundant and stable genetic markers. Common 
diseases are best explained by common genetic alterations, and the 
natural variation in the human population aids in understanding disease, 
5 therapy and environmental interactions. 

The organization of SNPs in the primary sequence of a gene into 
one of the limited number of combinations that exist as units of 
inheritance is termed a haplotype. Each haplotype therefore contains 
significantly more information than individual unorganized polymorphisms 

10 and provides an accurate measurement of the genomic variation in the 
two chromosomes of an individual. While it is well-established that many 
diseases are associated with specific variation in gene sequences and 
there are examples in which individual polymorphisms act as genetic 
markers for a particular phenotype, in other cases an individual 

15 polymorphism may be found in a variety of genomic backgrounds and 
therefore shows no definitive coupling between the polymorphism and the 
phenotype. In these instances, the observed haplotype and its frequency 
of occurrence in various genotypes will provide a better genetic marker 
for the phenotype. 

20 Although risk factors for the development of cardiovascular disease 

are known, such as high serum cholesterol levels and low serum high 
density lipoprotein (HDL) levels, the genetic basis for the manifestation of 
these phenotypes remains unknown. An understanding of the genes that 
are responsible for controlling cholesterol and HDL levels, along with 

25 useful genetic markers and mutations in these genes that affect these 
phenotypes, will allow for detection of a predisposition for these risk 
factors and/or cardiovascular disease and the development of therapeutics 
to modulate such alterations. 
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Therefore, among the objects herein, it is an object herein to 
provide methods and products for detection of a predisposition for these 
risk factors and/or cardiovascular disease. 
SUMMARY OF THE INVENTION 
5 Provided herein are methods for using polymorphic markers to 

detect a predisposition to the manifestation of high serum cholesterol, low 
serum HDL and cardiovascular disease. The ultimate goals are the 
elucidation of pathological pathways, developing new diagnostic assays, 
determining genetic profiles for positive responses to therapeutic drugs, 
10 identifying new potential drug targets and identifying new drug 
candidates. 

A database of twins was screened for individuals that exhibit high 
or low levels of serum cholesterol or HDL. Using a full genome scanning 
approach, SNPs present in DNA samples from these individuals were 

15 examined for alleles that associate with either high levels of cholesterol or 
low levels of HDL. This lead to the discovery of the association of the 
cytochrome C oxidase subunit Vlb (COX6B) gene and the N-acetylglu- 
cosaminyl transferase component glycosylphosphatidylinositol-1 (referred 
to herein as GPI-1) gene with these risks factors for developing 

20 cardiovascular disease. Specifically, a previously undetermined 

association of an allelic variant at nucleotide 86 of the COX6B gene and 
high serum cholesterol levels has been discovered. In addition, it has 
been discovered that an allelic variant at nucleotide 2577 of the GPI-1 
gene is associated with low serum HDL levels. There was no previously 

25 known association between these two genes and risk factors related to 
cardiovascular disease. 

Methods are provided for detecting the presence or absence of at 
least one allelic variant associated with high cholesterol, low HDL and/or 
cardiovascular disease by detecting the presence or absence of at least 
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one allelic variant of the COX6B gene or the GPI-1 gene, individually or in 
combination with one or more allelic variants of other genes associated 
with cardiovascular disease. 

Also provided are methods for indicating a predisposition to 
5 manifesting high serum cholesterol, low serum HDL and/or cardiovascular 
disease based on detecting the presence or absence of at least one allelic 
variant of the COX6B or GPI-1 genes, alone or in combination with one or 
more allelic variants of other genes associated with cardiovascular 
disease. These methods, referred to as haplotyping, are based on 

10 assaying more than one polymorphism of the COX6B and/or GPI-1 genes. 
One or more polymorphisms of other genes associated with 
cardiovascular disease may also be assayed at the same time. A 
collection of allelic variants of one or more genes may be more 
informative than a single allelic variant of any one gene. A single 

15 polymorphism of a collection of polymorphisms present in the COX6B 
and/or GPI-1 genes and in other genes associated with cardiovascular 
disease may be assayed individually or the collection may be assayed 
simultaneously using a multiplex assay method. 

Also provided are microarrays that include a probe selected from 

20 among an oligonucleotide complementary to a polymorphic region 

surrounding position 86 of the sense strand of the COX6B gene coding 
sequence; an oligonucleotide complementary to a polymorphic region 
surrounding the position of the antisense strand of COX6B corresponding 
to position 86 of the sense strand of the COX6B gene coding sequence; 

25 an oligonucleotide complementary to a polymorphic region surrounding 
position 2577 of the sense strand of the GPI-1 gene; and an 
oligonucleotide complementary to a polymorphic region surrounding the 
position of the antisense strand of GPI-1 corresponding to position 2577 
of the sense strand of the GPI-1 gene. Microarrays are well known and 
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can be made, for example, using methods set forth in U.S. Patent Nos, 
5,837,832; 5,858,659; 6,043,136; 6,043,031 and 6,156,501. 

Further provided are methods of using allelic variants of the COX6B 
or GPI-1 gene individually or together with one or more allelic variants of 
5 other genes associated with cardiovascular disease to predict a subject's 
response to a biologically active agent that modulates serum cholesterol, 
serum HDL, or a cardiovascular drug. 

Also provided are methods to screen candidate biologically active 
agents for modulation of cholesterol, HDL or other factors associated with 

10 cardiovascular disease. These methods use cells or transgenic animals 
containing one or more allelic variants of the COX6B gene and/or the 
GPI-1 gene alone or in combination with allelic variants of one or more 
other genes associated with cardiovascular disease. Such animals should 
exhibit high cholesterol, low HDL or other known phenotypes associated 

15 with cardiovascular disease. Also, provided are methods to construct 

transgenic animals that are useful as models for cardiovascular disease by 
using one or more allelic variants of the COX6B gene and/or the GPI-1 
gene alone or in combination with allelic variants of one or more other 
genes associated with cardiovascular disease. 

20 Further provided are combinations of probes and primers and kits 

for predicting a predisposition to high serum cholesterol, low HDL levels 
and/or cardiovascular disease. In particular, combinations and kits 
contain probes or primers that are capable of hybridizing adjacent to or at 
polymorphic regions of the COX6B and/or GPI-1 gene. The combinations 

25 and kits can also contain probes or primers that are capable of hybridizing 
adjacent to or at polymorphic regions of other genes associated with 
cardiovascular disease. The kits also optionally contain instructions for 
carrying out assays, interpreting results and for aiding in diagnosing a 
subject as having a predisposition towards developing high serum 
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cholesterol, low HDL levels and/or cardiovascular disease. Combinations 
and kits are also provided for predicting a subject's response to a 
therapeutic agent directed toward modulating cholesterol, HDL, or another 
phenotype associated with cardiovascular disease. Such combinations 
5 and kits contain probes or primers as described above. 

In particular for the methods, combinations, kits and arrays 
described above, the polymorphisms are SNPs. The detection or 
identification is of a T nucleotide at position 86 of the sense strand of the 
COX6B gene coding sequence or the detection or identification of an A 

10 nucleotide at the corresponding position in the antisense strand of the 
COX6B gene coding sequence. Also embodied is the detection or 
identification of an A nucleotide at position 2577 of the sense strand of 
the GPI-1 gene or the detection or identification of a T nucleotide at the 
corresponding position in the antisense strand of the GPI-1 gene. In 

15 addition to the SNPs discussed above, other polymorphisms of the 
COX6B and GPI-1 genes can be assayed for association with high 
cholesterol or low HDL, respectively, and used as disclosed above. 

Other genes containing allelic variants associated with high serum 
cholesterol, low HDL and/or cardiovascular disease, include, but are not 

20 limited to: cholesterol ester transfer protein, plasma (CETP); 

apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO AD; apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a 
gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 
(ABC 1); paraoxonase 1 (PON 1);.paraoxonase 2 (PON 2); 5,10- 

25 methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

The detection of the presence or absence of an allelic variant can 
use, but are not limited to, methods such as allele specific hybridization, 
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primer specific extension, oligonucleotide ligation assay, restriction 
enzyme site analysis and single-stranded conformation polymorphism 
analysis. 

In particular, primers used in primer specific extension hybridize 
5 adjacent to nucleotide 86 of the COX6B gene or nucleotide 2577 of the 
GPI-1 gene or the corresponding positions on the antisense strand 
{numbers refer to GenBank sequences, see pages 15-17). A primer can 
be extended in the presence of at least one dideoxynucleotide, particularly 
ddG, or two dideoxynucleotides, particularly ddG and ddC. Typically, 

10 detection of extension products is by mass spectrometry. Detection of 
allelic variants can also involve signal moieties such as radioisotopes, 
enzymes, antigens, antibodies, spectrophotometric reagents, 
chemiluminescent reagents, fluorescent reagents and other light 
producing reagents. 

15 Other probes and primers useful for the detection of allelic variants 

include those that hybridize at or adjacent to the SNPs described in Tables 
1-3 and specifically those that include SEQ ID NOs.: 5, 10, 43, 48, 53, 
58, 63, 68, 73, 78, 83, 88, 93, 98, 103, 108, 113, and 118. 
DESCRIPTION OF THE DRAWINGS 

20 Figure 1 depicts the allelic frequency and genotype for pools and 

individually determined samples of blood from individuals having low 
cholesterol levels and those with high cholesterol levels. 

Figure 2 depicts the allelic frequency and genotype for pools and 
individually determined samples of blood from individuals having high HDL 

25 levels and those with low HDL levels. 
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DET AILED DESCRIPTION 
A. Definitions 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
5 in the art to which this invention belongs. All patents, patent applications 
and publications referred to throughout the disclosure herein are, unless 
noted otherwise, incorporated by reference in their entirety. In the event 
that there are a plurality of definitions for terms herein, those in this 
section prevail. 

10 As used herein, sequencing refers to the process of determining a 

nucleotide sequence and can be performed using any method known to 
those of skill in the art. For example, if a polymorphism is identified or 
known, and it is desired to assess its frequency or presence in nucleic 
acid samples taken, from the subjects that of the database, the region of 

15 interest from the samples can be isolated, such as by PCR or restriction 
fragments, hybridization or other suitable method known to those of skill 
in the art, and sequenced. For purposes herein, sequencing analysis, for 
example, can be effected using mass spectrometry (see, e.g., U.S. Patent 
Nos. 5,547,835, 5,622,824, 5,851,765, and 5,928,906). Nucleic acids 

20 also can be sequenced by hybridization (see, e.g., U.S. Patent Nos. 
5,503,980, 5,631,134, 5,795,714) and including analysis by mass 
spectrometry (see, U.S. Application Serial Nos. 08/419,994 and 
09/395,409). Alternatively, sequencing may be performed using other 
known methods, such as set forth in U.S. Patent Nos. 5,525,464; 

25 5,695,940; 5,834,189; 5,869,242; 5,876,934; 5,908,755; 5,912,118; 
5,952,174; 5,976,802; 5,981,186; 5,998,143; 6,004,744; 6,017,702; 
6,018,041; 6,025,136; 6,046,005; 6,087,095; 6,117,634, 6,013,431, 
WO 98/30883; WO 98/56954; WO 99/09218; WO/00/58519, and the 
others. 
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As used herein, "polymorphism" refers to the coexistence of more 
than one form of a gene or portion thereof. A portion of a gene of which 
there are at least two different forms, i.e., two different nucleotide 
sequences, is referred to as a "polymorphic region of a gene". A 
5 polymorphic region can be a single nucleotide, the identity of which 
differs in different alleles. A polymorphic region also can be several 
nucleotides in length. 

As used herein, "polymorphic gene" refers to a gene having at least 
one polymorphic region. 

10 As used herein, "allele", which is used interchangeably herein with 

"allelic variant" refers to alternative forms of a gene or portions thereof. 
Alleles occupy the same locus or position on homologous chromosomes. 
When a subject has two identical alleles of a gene, the subject is said to 
be homozygous for the gene or allele. When a subject has two different 

15 alleles of a gene, the subject is said to be heterozygous for the gene. 

Alleles of a specific gene can differ from each other in a single nucleotide, 
or several nucleotides, and can include substitutions, deletions, and 
insertions of nucleotides. An allele of a gene also can be a form of a gene 
containing a mutation. 

20 As used herein, the term "subject" refers to mammals and in 

particular human beings. 

As used herein, the term "gene" or "recombinant gene" refers to a 
nucleic acid molecule comprising an open reading frame and including at 
least one exon and (optionally) at least one intron sequence. A gene can 

25 be either RNA or DNA. Genes may include regions preceding and 
following the coding region (leader and trailer). 

As used herein, "intron" refers to a DNA sequence present in a 
given gene that is spliced out during mRNA maturation. 
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As used herein, the term "coding sequence" refers to that portion 
of a gene that encodes an amino acid sequence of a protein. 

As used herein, the term "sense strand" refers to that strand of a 
double-stranded nucleic acid molecule that encodes the sequence of the 
5 mRNA that encodes the amino acid sequence encoded by the double- 
stranded nucleic acid molecule. 

As used herein, the term "antisense strand" refers to that strand of' 
a double-stranded nucleic acid molecule that is the complement of the 
sequence of the mRNA that encodes the amino acid sequence encoded 
10 by the double-stranded nucleic acid molecule. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic 
acid that includes a preselected conserved nucleotide sequence. By the 
term "substantially homologous" is meant having at least 80%, preferably 
at least 90%, most preferably at least 95% homology therewith or a less 
15 percentage of homology or identity and conserved biological activity or 
function. 

Regarding hybridization, as used herein, stringency conditions to 
achieve specific hybridization refer to the washing conditions for 
removing the non-specific probes or primers and conditions that are 
20 equivalent to either high, medium, or low stringency as described below: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65 °C 

2) medium stringency: 0.2 x SSPE, 0.1 % SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C. 

It is understood that equivalent stringencies may be achieved using 
25 alternative buffers, salts and temperatures. 

As used herein, "heterologous DNA" is DNA that encodes RNA and 
proteins that are not normally produced in vivo by the cell in which it is 
expressed or that mediates or encodes mediators that alter expression of 
endogenous DNA by affecting transcription, translation, or other 
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regulatable biochemical processes or is not present in the exact 
orientation or position as the counterpart DNA in a wildtype cell. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA 
that one of skill in the art would recognize or consider as heterologous or 
5 foreign to the cell in which is expressed is herein encompassed by 

heterologous DNA. Examples of heterologous DNA include, but are not 
limited to, DNA that encodes traceable marker proteins, such as a protein 
that confers drug resistance, DNA that encodes therapeutically effective 
substances, such as anti-cancer agents, enzymes and hormones, and 
10 DNA that encodes other types of proteins, such as antibodies. Antibodies 
that are encoded by heterologous DNA may be secreted or expressed on 
the surface of the cell in which the heterologous DNA has been 
introduced. 

As used herein, a "promoter region" refers to the portion of DNA of 
15- a gene that controls transcription of the DNA to which it is operatively 
linked. The promoter region includes specific sequences of DNA that are 
sufficient for RNA polymerase recognition, binding and transcription 
initiation. This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences that 
20 modulate this recognition, binding and transcription initiation activity of 
the RNA polymerase. These sequences may be cis acting or may be 
responsive to trans acting factors. Promoters, depending upon the nature 
of the regulation, may be constitutive or regulated. 

As used herein, the phrase "operatively linked" generally means the 
25 sequences or segments have been covalently joined into one piece of 
DNA, whether in single or double stranded form, whereby control or 
regulatory sequences on one segment control or permit expression or 
replication or other such control of other segments. The two segments 
are not necessarily contiguous. For gene expression a DNA sequence and 
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a regulatory sequence(s) are connected in such a way to control or permit 
gene expression when the appropriate molecular, e.g., transcriptional 
activator proteins, are bound to the regulatory sequence(s). 

As used herein, the term "vector" refers to a nucleic acid molecule 
5 capable of transporting another nucleic acid to which it has been linked. 
One exemplary type of vector is an episome, i.e., a nucleic acid capable 
of extra-chromosomal replication. Exemplary vectors include those 
capable of autonomous replication and/or expression of nucleic acids to 
which they are linked. Vectors capable of directing the expression of 

10 genes to which they are operatively linked are referred to herein as 
"expression vectors". In general, expression vectors of utility in 
recombinant DNA techniques are often in the form of "plasmids" that 
refer generally to circular double stranded DNA loops which, in their 
vector form are not bound to the chromosome. "Plasmid" and "vector" 

15 are used interchangeably as the plasmid is the most commonly used form 
of vector. Also included are other forms of expression vectors that serve 
equivalent functions and that become known in the art subsequently 
hereto. 

As used herein, "indicating" or "determining" means that the 
20 presence or absence of an allelic variant may be one of many factors that 
are considered when a subject's predisposition to a disease or disorder is 
evaluated. Thus a predisposition to a disease or disorder is not 
necessarily conclusively determined by only ascertaining the presence or 
absence of one or more allelic variants, but the presence of one of more 
25 of such variants is among an number of factors considered. 

As used herein, "predisposition to develop a disease or disorder" 
means that a subject having a particular genotype and/or haplotype has a 
higher likelihood than one not having such a genotype and/or haplotype 
for developing a particular disease or disorder. 
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As used herein, "transgenic animal" refers to any animal, generally 
a non-human animal, e.g. a mammal, bird or an amphibian, in which one 
or more of the cells of the animal contain heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic 
5 techniques well known in the art. The nucleic acid is introduced into the 
cell, directly or indirectly by introduction into a precursor of the cell, by 
way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does 
not include classical cross-breeding, or in vitro fertilization, but rather is 

10 directed to the introduction of a recombinant DNA molecule. This 
molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. In the typical transgenic animals 
described herein, the transgene causes cells to express a recombinant 
form of a protein. However, transgenic animals in which the recombinant 

15 gene is silent are also contemplated, as for example, using the FLP or CRE 
recombinase dependent constructs. Moreover, "transgenic animal" also 
includes those recombinant animals in which gene disruption of one or 
more genes is caused by human intervention, including both 
recombination and antisense techniques. 

20 As used herein, "transgene" describes genetic material that has 

been or is about to be artificially inserted into the genome of a mammalian 
cell, particularly a mammalian cell of a living animal. The transgene is 
used to transform a cell, meaning that a permanent or transient genetic 
change, typically a permanent genetic change, is induced in a cell 

25 following incorporation of exogenous DNA. A permanent genetic change 
is generally achieved by introduction of the DNA into the genome of the 
cell. Vectors for stable integration include, but are not limited to, 
plasmids, retroviruses and other animal viruses and YACS. Of interest are 


WO 02/072604 



PCTYUS02/06728 


-15- 

transgenic mammals, including, but are not limited to, cows, pigs, goats, 
horses and others, and particularly rodents, including rats and mice. 

As used herein, "associated" refers to coincidence with the 
development or manifestation of a disease, condition or phenotype. 
5 Association may be due to, but is not limited to, genes responsible for 
housekeeping functions, those that are part of a pathway that is involved 
in a specific disease, condition or phenotype and those that indirectly 
contribute to the manifestation of a disease, condition or phenotype. 

As used herein, "high serum cholesterol" refers to a level of serum 
10 cholesterol that is greater than that considered to be in the normal range 
for a given age in a population, e.g., about 5.25 mmoles/L or greater, i.e., 
approximately one standard deviation or more away from the age-adjusted 
mean. 

As used herein, "low serum HDL" refers to a level of serum HDL 
15 that is less than that considered to be in the normal range for a given age 
in a population, e.g. about 1.11 mmoles/L or less, i.e., approximately one 
standard deviation or more away from the age-adjusted mean. 

As used herein, "cardiovascular disease" refers to any 
manifestation of or predisposition to cardiovascular disease including, but 
20 not limited to, coronary artery disease and myocardial infarction. Included 
in predisposition is the manifestation of risks factors such as high serum 
cholesterol levels and low serum HDL levels. 

As used herein, "target nucleic acid" refers to a nucleic acid 
molecule that contains all or a portion of a polymorphic region of a gene 
25 of interest. 

As used herein, "signal moiety" refers to any moiety that allows for 
the detection of a nucleic acid molecule. Included are moieties covalently 
attached to nucleic acids and those that are not. 
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As used herein, "biologically active agent that modulates serum 
cholesterol" refers to any drug, including, but are not limited to, small 
molecule, nucleic acid (sense and antisense), protein, peptide, lipid, 
carbohydrate and combinations thereof, that exhibits some effect directly 
5 or indirectly on the cholesterol measured in a subject's serum. 

As used herein, "biologically active agent that modulates serum 
HDL" refers to any drug, such as, but are not limited to, small molecule, 
nucleic acid (sense and antisense), protein, peptide, lipid, carbohydrate 
and combinations thereof that exhibits some effect directly or indirectly 
10 on the HDL measured in a subject's serum. 

As used herein, "expression and/or activity" refers to the level of 
transcription or translation of the COX6B or GPI-1 gene, mRNA stability, 
protein stability or biological activity. 

As used herein, "cardiovascular drug" refers to a drug used to treat 
15 cardiovascular disease or a risk factor for the disease, either 

prophylactically or after a risk factor or disease condition has developed. 
Cardiovascular drugs include those drugs used to lower serum cholesterol 
and those used to alter the level of serum HDL, 

As used herein, "combining" refers to contacting the biologically 
20 active agent with a cell or animal such that the agent is introduced into 
the cell or animal. For a cell any method that results in an agent 
traversing the plasma membrane is useful. For an animal any of the 
standard routes of administration of an agent, e.g. oral, rectal, 
transmucosal, intestinal, intravenous, intraperitoneal, intraventricular, 
25 subcutaneous, intramuscular and other routes can be used. 

As used herein," positive response" refers to improving or 
ameliorating at least one symptom or detectable characteristic of a 
disease or condition, e.g., lowering serum cholesterol levels or raising 
serum HDL levels. 
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As used herein, "biological sample" refers to any cell type or tissue 
of a subject from which nucleic acid, particularly DNA, can be obtained. 

As used herein, "array" refers to a collection of three or more 
items, such a collection of immobilized nucleic acid probes arranged on a 
5 solid substrate, such as silica, polymeric materials, glass and other 
suitable support materials known to those of skill in the art. 

As used herein, a composition refers to any mixture. It may be a 
solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or 
any combination thereof. 
10 As used herein, a combination refers to any association between 

two or among more items. 

As used herein, "kit" refers to a package that contains a 
combination, such as one or more primers or probes used to amplify or 
detect polymorphic regions of genes associated with cardiovascular 
15 disease, optionally including instructions and/or reagents for their use. 

As used herein "specifically hybridizes" refers to hybridization of a 
probe or primer only to a target sequence preferentially to a non-target 
sequence. Those of skill in the art are familiar with parameters that affect 
hybridization; such as temperature, probe or primer length and 
20 composition, buffer composition and salt concentration and can readily 
adjust these parameters to achieve specific hybridization of a nucleic acid 
to a target sequence. 

As used herein "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA) and ribonucleic acid (RNA}. The term should 
25 also be understood to include, as equivalents, derivatives, variants and 
analogs of either RNA or DNA made from nucleotide analogs, single 
(sense or antisense) and double-stranded polynucleotides. 
Deoxyribonucleotides include deoxyadenosine, deoxycytidine, 
deoxyguanosine and deoxythymidine. For RNA, the uracil base is uridine. 
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As used herein, "mass spectrometry" encompassses any suitable 
mass spectrometry format known to those of skill in the art. Such 
formats include, but are not limited to, Matrix-Assisted Laser 
Desorption/lonization, Time-of-Flight (MALDI-TOF), Electrospray (ES), IR- 
5 MALDI (see, e.g., published International PCT Application No. 99/57318 
and U.S. Patent No. 5,118,937) Ion Cyclotron Resonance (ICR), Fourier 
Transform and combinations thereof. MALDI, particular UV and IR, are 
among exemplary formats. 

As used herein, the GPI-1 gene is generically used to include the 
10 human GPI-1 gene and its homologs from rat, mouse, guinea pig, mouse 
and other mammalian species. As described below, the GPI-1 gene refers 
to a component of the GlcNAc transferase activity complex that functions 
in the biosynthesis of glycosylphosphatidylinositol (GPI) anchor. Four 
mammalian gene products (PIG-A, PIG-H, PIG-C and GPI-1) form a protein 
15 complex that is responsible for the transferase enzyme activity in the 

biosynthesis reaction . PIG-A, PIG-H, PIG-C are required for the first step 
in GPI anchor biosynthesis; GPI-1 is not. Stabilization of the enzyme 
complex, rather than participation in GlcNAc transfer, has been suggested 
as a possible role for GPI-1 (Watanabe et al. EMBO 17:877, 1998). 
20 B. Cytochrome c oxidase Vlb gene 

Cytochrome c oxidase (COX) is a mitochondrial enzyme complex 
integrated in the inner membrane. It transfers electrons from cytochrome 
to molecular oxygen in the terminal reaction of the respiratory chain in 
eukaryotic cells. COX contains of three large subunits encoded by the 
25 mitochondrial genome and 10 other subunits, encoded by nuclear genes. 
The three subunits encoded by mitochondrial genome are responsible for 
the catalytic activity. The cytochrome c oxidase subunit Vlb (COX6B) is 
one of the nuclear gene products. The function of the nuclear encoded 
subunits is unknown. One proposed role is in the regulation of catalytic 
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activity; specifically the rate of electron transport and stoichiometry of 
proton pumping. Other proposed roles are not directly related to electron 
transport and include energy-dependent calcium uptake and protein import 
by the mitochondrion. Proteolytic removal of subunits Via and Vlb has 
5 been associated with loss of calcium transport in reconstituted vesicles. 
Steady-state levels of the COX6B transcript are different in different 
tissues (Taanman et a/.. Gene (1990), 93:285). The COX6B gene is 
includes the human COX6B gene and its homologs from rat, mouse, 
guinea pig, and any species that has a homologous gene. 

10 Several single nucleotide polymorphism have been identified in the 

human COX6B gene. One of these is located at position 86 and is a C to 
T trans version that is manifested as a silent mutation in the coding region, 
ACC to ACT (threonine to threonine) (SEQ ID NO.: 2). Although this is a 
silent mutation at the amino acid level, it may represent an alteration that 

15 changes codon usage, or it may effect mRNA stability or it may be in 
linkage disequilibrium with a non-silent change. Other known single 
nucleotide polymorphisms of the COX6B gene include, but are not limited 
to, those listed in Table 1 , 

TABLE 1 


Gene 

GenBank Accession No. 

SNP 

SNP Location 

COX6B 

NMJ>01863 

C/T 

86 

(SEQ ID NO.: 1) 


A/G 

60 



A/T 

324 



A/T 

123 
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Based on methods disclosed herein and those used in the art, one 
of skill would be able to use all the SNPs described and find additional 
polymorphic regions of the COX6B gene to determine whether allelic 
variants of these regions are associated with high cholesterol levels and 
5 cardiovascular disease. 
C. GPI-1 Gene 

Glycosylphosphatidylinositol (GPI) functions to anchor various 
eukaryotic proteins to membranes and is essential for their surface 
expression. Thus, a defect in GPI anchor synthesis affects various 

10 functions of cell, tissues and organs. Biosynthesis of glycosyl- 
phosphatidylinositol (GPI) is initiated by the transfer of N- 
acetylglucosamine (GlcNAc) from UDP-GlcNac to phosphatidylinositol (PI) 
and is catalyzed by a GlcNAc transferase, GPI-GlcNAc transferase (GPI- 
GnT). Four mammalian gene products form a protein complex that is 

15 responsible for this enzyme activity (PIG-A, PIG-H, PIG-C and GPI-1). 
PIG-A, PIG-H, PIG-C are required for the first step in GPI anchor 
biosynthesis; GPI-1 is not. Stabilization of the enzyme complex, rather 
than participation in GlcNAc transfer, has been suggested as a possible 
role for GPI-1 (Watanabe et al. EMBO 17:877, 1998). 

20 A polymorphism has been identified at position 2577 of the human 

GPI-1 gene. This is a G to A transversion. This SNP is located in the 3' 
untranslated region of the mRNA, and does not affect protein structure, 
but may affect mRNA stability or may be in linkage disequilibrium with a 
non-silent change. Other known single nucleotide polymorphisms of the 

25 GPI-1 gene include, but are not limited to, those listed in Table 2. 


WO 02/072604 



PCT/US02/06728 


-21- 


TABLE 2 


Gene 

GenBank Accession No. 

SNP 

SNP Location 

GPI-1 

NM 004204 

C/T 

2829 

(SEQ ID NOS.: 6, 7) 


A/G 

2577 



C/T 

2519 



C/T 

2289 



C/T 

1938 



C/G 

1563 



A/G/C/T 

2664 



A/G 

2656 



A/C/T 

2167 



G/C/A 

2166 


Based on methods disclosed herein and those used in the art, one 
15 of skill would be able to use all the described SNPs and find additional 
polymorphic regions of the GPI-1 gene to determine whether allelic 
variants of these regions are associated with low levels of HDL and 
cardiovascular disease. 

D. Other genes and polymorphism associated with cardiovascular 
20 disease 

Many other genes and polymorphisms contained within them have 
been associated with risks factors for cardiovascular disease (aberrations 
in lipid metabolism; specifically high levels of serum cholesterol and low 
levels of HDL and other such indicators) and/or the clinical phenotypes of 

25 atherosclerosis and cardiovascular disease. Table 3 presents a list of 
some of these genes and some associated polymorphisms (SNPs): 
cholesterol ester transfer protein, plasma (CETP); apolipoprotein A-IV 
(APO A4); apolipoprotein A-l (APO A1); apolipoprotein E (APO E); 
apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a gene encoding 

30 lipoprotein lipase (LPL); ATP-binding cassette transporter (ABC 1); 
paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
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lipase (LIPC); E-selectin; G protein beta 3 subunit and angiotensin II type 
1 receptor gene. The SNP locations are based on the GenBank sequence. 
Table 3 is not meant to be exhaustive, as one of skill in the art based on 
the disclosure would be able to readily use other known polymorphisms in 
5 these and other genes, new polymorphisms discovered in previously 
identified genes and newly identified genes and polymorphisms in the 
methods and compositions disclosed herein. 


TABLE 3 


Gene 

GenBank Accession No. 

SNP 

SNP 

Location 

CETP 

(SEQ ID NOS.: 11, 12) 

NM 000078 

C/A 

991 

C/T 

196 

A/G 

1586 

A/G 

1394 

A/G 

1439 

C/G 

1297 

C/T 

766 

G/A 

1131 

G/A 

1696 

LPL 

(SEQ ID NOS.: 13, 14) 

NM_000237 

A/G 

1127 

A/C 

3447 

C/T 

1973 

C/T 

3343 

G/A 

2851 

C/T 

3272 

A/T 

2428 

T/C 

2743 

G/A 

1453 

C/A 

3449 

G/A 

1282 

G/A 

579 

A/C 

1338 

A/G/T/C 

2416-2426 

A/G 

2427 

C/T 

1302 

G/A 

609 
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TABLE 3 






1 £QR 









P/T 





C/T 


5 



G/A 

280 




G/A 

1036 


APO A4 


O/ 1 

i i ^ 


/ccn in wn<5 • ic ia\ 


U/ w 

1033 

1 WWW 




G/A 

1002 

10 



C/T 

960 




C/T 

894 




G/A 

554 




G/A 

950 




T/C 

336 

15 



G/A 

334 




C/T 

330 




A/G 

201 




A/G 

16 




A/T 

1213 

20 

APO E 

r\T\J b 

NM 000041 

C/T 

448 


(SEQ ID NOS.: 17, 18) 


G/A 

448 


(mRNA) 


C/T 

586 




C/T 

197 




C/T 

540 

25 

Hepatic Lipase 

NM 000236 

C/G 

680 


(SEQ ID NOS.: 19, 20) 


G/A 

1374 




G/A 

701 




C/A 

1492 




A/G 

648 

30 



G/C 

729 




G/A 

340 




G/T 

522 


PON 1 

NM_000446 

A/T 

172 


(SEQ ID NOS.: 21, 22) 


A/G 

584 

35 



G/C 

190 
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TABLE 3 



PON 2 

XIM 004947 

C/G 

475 


(SEQ ID NOS.: 23, 24) 


C/G 

964 


APO C3 

NM 000040 

C/T 

148 


(SEQ ID NOS.: 25, 26) 


T/A 

471 

5 



G/C 

386 




G/T 

417 




T/A 

495 


ADf -I 

MDU 1 

(SEQ ID NOS.: 27, 28) 

yiwi 005567 

G/A 

8591 

10 

APO A1 

NM 000039 

C/G 

770 


(SEQ ID NOS.: 29, 30) 


G/A 

656 




C/G 

589 




C/G 

414 




A/T 

430 

15 



C/T 

708 




C/T 

221 




T/G 

223 




C/T 

597 




A/G 

340 

20 



G/C 

690 


ADO R 
Mrw D 

NM 000384 

A/G/C/T 

13141 


(SEQ ID NOS * 31 32) 


A/G/C/T 

12669 




C/T 

11323 




G/C 

10422 




A/C 

10408 




C/G 

10083 




C/T 

7064 




C/T 

6666 




C/T 

1980 

30 



C/G 

5751 




C/T 

7673 




C/A/G/T 

8344 




G/C/T/A 

4393 




A/C/T/G 

5894 

35 



A/T 

12019 




C/T 

11973 
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TABLE 3 





G/C/T/A 

7065 




C/G 

947 




C/G 

7331 




A/G 

7221 

5 



G/C 

6402 




G/C 

3780 




C/G 

1661 




A/T 

8167 




C/A 

8126 

10 



C/T 

421 




C/T 

1981 




G/A 

12510 




G/C 

12937 


APO B (con't) 


G/A 

11042 

15 



C/T 

2834 




A/G 

5869 




A/G 

11962 




C/G 

4439 




G/A 

7824 

20 



G/A 

13569 




G/A 

9489 




G/A 

2325 




G/A 

10259 




C/G 

14 


ivi i nrn 

NM 005957 

G/A 

5442 


(SEQ ID NOS.: 33, 34) 


A/G 

5113 




A/G 

5113 




A/G 

5110 




A/G 

5102 

30 



A/C/T 

5097 




A/C/T 

5097 




C/T 

5079 




C/T 

5079 




T/C 

5071 

35 



T/C 

5071 




T/C 

5051 
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TABLE 3 


10 


15 


20 


25 


30 


35 


MTHFR (con't) 


G/A 

5012 

C/A 

5000 

A/G 

4998 

A/G 

4994 

A/G 

4994 

A/G 

4994 

C/T 

4991 

C/T 

4991 

C/T 

4991 

A/G 

4986 

A/G 

4986 

A/G 

4986 

C/T 

4985 

T/A 

4982 

T/G 

4981 

T/C 

4981 

T/C 

4981 

G/C/A 

4967 

G/A 

4963 

A/G 

4962 

G/C/T 

4962 

A/C/G/T 

4961 

A/C/T 

4961 

A/C 

4961 

A/C 

4961 

A/C/T 

4960 

T/C 

4938 

T/C 

4937 

T/C 

4933 

G/C/T 

4933 

C/T 

4929 

C/T 

4929 

T/A/G 

4929 

A/G 

4928 

G/C 

4928 

C/G 

4927 
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TABLE 3 


15 


20 


25 


30 


35 




G/A 

4923 



C/T 

4919 



A/T/G 

4913 



C/T 

4912 



A/T 

4903 



C/T 

4902 



A/G 

4900 



G/A 

4898 



G/T 

4898 



C/T 

4897 



G/T 

4894 



T/C/G 

4836 



C/T 

3862 



C/T 

4922 



C/T 

4959 



T/C 

4981 



A/G 

4994 



A/G 

5044 



T/C 

5051 



G/C 

5066 



C/T 

5079 

MTHFR (ron't) 
Ivi I nrn \i»un w 


C/A/G 

5085 



C/T 

5092 



A/G 

5103 



A/G 

5113 



C/T 

1021 

E-Setectin 

NM_000450 

G/A 

3484 

(SEQ1D NOS.:35, 36) 


G/A 

3093 



T/G 

2939 



T/C 

2902 



C/T 

1937 



C/T 

1916 



C/T 

1839 



C/T 

1805 



C/T 

1518 



G/C 

1377 
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TABLE 3 


10 




C/T 

1376 


G/A ' 

999 


T/C 

857 


A/C 

561 


C/G 

506 

A/G 

392 

G/T 

98 

G protein 03 subunit 
ISEQ IDNOS.: 37, 38) 

NM_002075 

C/T 

1828 

C/T 

1546 

G/T 

1431 

G/A 

1231 

C/T 

1230 

Angiotensin 11 type 1 

receptor gene 

(SEQ ID NOS.: 39, 40) 

NM_00686 

G/A 

1453 

C/G 

968 

G/C 

966 

T/C 

941 

G/A 

894 

T/C 

659 


20 Assays to identify the nucleotide present at the polymorphic site 

include those described herein and all others known to those who practice 
the art. 

For some of the SNPs described above, there are provided a 
description of the MassEXTEND™ reaction components that can be used 

25 to determine the allelic variant that is present. Included are the forward 
arid reverse primers used for amplification. Also included are the 
MassEXTEND 1 " primer used in the primer extension reaction and the 
extended MassEXTEND™ primers for each allele. MassEXTEND™ 
reactions are carried out and the products analyzed as described in 

30 Examples 2 and 3. 
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CETP 

Position 991 (C/A) 
5 PCR primers: 
Forward: 

10 Reverse: 

MassEXTEND'" Primer: 

15 

Primer Mass: 

Extended Primer- Allele C: 

20 

Extended Primer Mass: 
Extended Primer-Allele A: 

25 

Extended Primer Mass: 
Position 196 (C/T) 
30 PCR primers: 
Forward: 

35 Reverse: 

MassEXTEND'" Primer: 

40 

Primer Mass: 

Extended Primer-Allele C: 
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ACTGCCTGATAACCATGCTG 
(SEQ ID NO.: 41) 

ATACTTACACACCAGGAGGG 
(SEQ ID NO.: 42) 

ATGCCTGCTCCAAAGGCAC 
(SEQ ID NO.: 43) 

5757.8 

ATGCCTGCTCCAAAGGCACC 
(SEQ ID NO.: 44) 

6030.9 

ATGCCTGCTCCAAAGGCACAT 
(SEQ ID NO.: 45) 

6359.2 


TACTTCTGGTTCTCTGAGCG 
(SEQ ID NO.: 46) 

ACTCACCTTGAACTCGTCTC 
(SEQ ID NO.: 47) 

TGGTTCTCTGAGCGAGTCTT 
(SEQ ID NO.: 48) 

6130 

TGGTTCTCTGAGCGAGTCTTC 
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Extended Primer Mass: 
5 Extended Primer-Allele T: 

Extended Primer Mass: 
10 Position 1586 (A/G) 
PCR primers: 
Forward: 

15 

Reverse: 

20 MassEXTEND™ Primer: 

Primer Mass: 
25 Extended Primer-Allele G: 

Extended Primer Mass: 
30 Extended Primer-Allele A: 

Extended Primer Mass: 
35 APOA4 

Position 1122 (G/T) 
PCR primers: 

40 

Forward: 
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(SEQ ID NO.: 49) 
6707.4 

TGGTTCTCTGAGCGAGTCTTTC 
(SEQ ID NO.: 50) 

6333.1 

TGCAGATGGACTTTGGCTTC 
(SEQ ID NO.: 51) 

TGCTTGCCTTCTGCTACAAG 
<SEQ ID NO.: 52) 

CTTCCCTGAGCACCTGCTG 
(SEQ ID NO.: 53) 

5715.7 

CTTCCCTGAGCACCTGCTGGT 
(SEQ ID NO.: 54) 

6333.1 

CTTCCCTGAGCACCTGCTGA 
(SEQ ID NO.: 55) 

6012.9 


AACAGCTCAGGACGAAACTG 
(SEQ ID NO.: 56) 
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Reverse: 


MassEXTEND™ Primer: 

5 

Primer Mass: 

Extended Primer-Allele G: 

10 

Extended Primer Mass: 
Extended Primer-Allele T: 

15 

Extended Primer Mass: 
Position 1033 (G/C) 

20 

PCR primers: 
Forward: 

25 

Reverse: 


MassEXTEND™ Primer: 

30 

Primer Mass: 

Extended Primer-Allele G: 

35 

Extended Primer Mass: 
Extended Primer-Allele C: 

40 
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AGAAGGAGTTGACCTTGTCC 
(SEQ ID NO.: 57) 

GGAAGCTCAAGTGGCCTTC 
(SEQ ID NO.: 58) 

5828.8 

GGAAGCTCAAGTGGCCTTCC 
{SEQ ID NO.: 59) 

6102.0 

GGAAGCTCAAGTGGCCTTCAAC 
(SEQ ID NO.: 60) 

6728.4 


A AGTC ACTGG C AG AGCTGG 
(SEQ ID NO.: 61) 

GCACCAGGGCTTTGTTGAAG 
(SEQ ID NO.: 62) 

TTTTCCCCGTAGGGCTCCA 
(SEQ ID NO.: 63) 

5730.7 

TTTTCCCCGTAGGGCTCCAC 
(SEQ ID NO.: 64) 

6003.9 

TTTTCCCCGTAGGGCTCCAGC 
(SEQ ID NO.: 65) 


Extended Primer Mass: 


6333.1 
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Position 1002 (G/A) 
PCR primers: 
5 Forward: 


Reverse: 

10 

MassEXTEND'" Primer: 


Primer Mass: 

15 

Extended Primer-Allele G: 


Extended Primer Mass: 

20 

Extended Primer-Allele A: 


Extended Primer Mass: 

25 

Position 960 (C/T) 
PCR primers: 
30 Forward: 


Reverse: 

35 

MassEXTEND'" Primer: 


Primer Mass: 

40 

Extended Primer-Allele T: 


TGCAGAAGTCACTGGCAGAG 
(SEQ ID NO.: 66) 

GTTGAAGTTTTCCCCGTAGG 
(SEQ ID NO.: 67) 

ACTCCTCCACCTGCTGGTC 
(SEQ ID NO.: 68) 

5675.7 

ACTCCTCCACCTGCTGGTCC 
(SEQ ID NO.: 69) 

5948.9 

ACTCCTCCACCTGCTGGTCTA 
(SEQ ID NO.: 70) 

6277.1 


AGGACGTGCGTGGCAACCTG 
(SEQ ID NO.: 71) 

AGCTCTGCCAGTGACTTCTG 
(SEQ ID NO.: 72) 

GTGACTTCTGCAGCCCCTC 
(SEQ ID NO.: 73) 

5715.7 

GTGACTTCTGCAGCCCCTCA 
(SEQ ID NO.: 74) 
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Extended Primer Mass: 
Extended Primer-Allele C: 

5 

Extended Primer Mass: 
Position 894 (C/T) 
10 PCR primers: 
Forward: 

15 Reverse: 

MassEXTEND™ Primer: 

20 

Primer Mass: 

Extended Primer-Allele C: 

25 

Extended Primer Mass: 
Extended Primer-Allele T: 

30 

Extended Primer Mass: 

Position 554 (G/A) 
PCR primers: 

35 

Forward: 
Reverse: 

40 

MassEXTEND™ Primer: 
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6012.9 

GTGACTTCTGCAGCCCCTCGGT 
(SEQ ID NO.: 75) 

6662.3 


CCTGACCTTCCAGATGAAG 
(SEQ ID NO.: 76) 

TCAGGTTGCCACGCACGTC 
(SEQ ID NO.: 77) 

CAGGATCTCGGCCAGTGC 
(SEQ ID NO.: 78) 

5500.6 

CAGGATCTCGGCCAGTGCC 
(SEQ ID NO.: 79) 

5773.8 

CAGGATCTCGGCCAGTGCTG 
(SEQ ID NO.: 80) 

6118.0 


ACCTGCGAGAGCTTCAGCAG 
(SEQ ID NO.: 81) 

TCTCCATGCGCTGTGCGTAG 
(SEQ ID NO.: 82) 

AGCTGCGCACCCAGGTCA 
(SEQ ID NO.: 83) 


WO 02/072604 



Primer Mass: 

Extended Primer-Allele A: 

5 

Extended Primer Mass: 
Extended Primer-Allele G: 

10 

Extended Primer Mass: 
APOE 

15 Position 448 (C/T) 
PCR primers: 

Forward: 

20 

Reverse: 

MassEXTEND™ Primer: 

25 

Primer Mass: 

Extended Primer-Allele C: 

30 

Extended Primer Mass: 
Extended Primer-Allele T: 

35 

Extended Primer Mass: 
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5469.6 

AGCTGCGCACCCAGGTCAA 
(SEQ ID NO.: 84) 

5766.8 

AGCTGCGCACCCAGGTCAGC 
(SEQ ID NO.: 85) 

6072.0 


TGTCCAAGGAGCTGCAGGC 
(SEQ ID NO.: 86) 

CTTACGCAGCTTGCGCAGGT 
(SEQ ID NO.: 87) 

GCGGACATGGAGGACGTG 
(SEQ ID NO.: 88) 

5629.7 

GCGGACATGGAGGACGTGC 
(SEQ ID NO.: 89) 

5902.8 

GCGGACATGGAGGACGTGTG 
(SEQ ID NO.: 90) 

6247.1 
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LPL 

Position 1127 (A/G) 
PCR primers: 

5 

Forward: 
Reverse: 

10 

MassEXTEND™ Primer: 


15 Primer Mass: 

Extended Primer-Allele A: 


20 Extended Primer Mass: 
Extended Primer-Allele G: 


25 Extended Primer Mass: 

Position 3447 (A/C) 
PCR primers: 

30 Forward: 


Reverse: 

35 

MassEXTEND'" Primer: 


Primer Mass: 

40 

Extended Primer-Allele A: 


GTTGTAGAAAGAACCGCTGC 
(SEQ ID NO.: 91) 

GAGAACGAGTCTTCAGGTAC 
(SEQ ID NO.: 92) 

ACAATCTGGGCTATGAGATCA 
(SEQ ID NO.: 93) 

6454.2 

ACAATCTGGGCTATGAGATCAA 
(SEQ ID NO.: 94) 

6751.4 

ACAATCTGGGCTATGAGATCAGT 
(SEQ ID NO.: 95) 

7071.6 


CACTCTACACTGCATGTCTC 
(SEQ ID NO.: 96) 

ACCCTTCTGAAAAGGAGAGG 
(SEQ ID NO.: 97) 

GAGGAGAGACAAGGCAGATA 
(SEQ ID NO.: 98) 

6273.1 

GAGGAGAGACAAGGCAGATAT 
(SEQ ID NO.: 99) 


WO 02/072604 



Extended Primer Mass: 
Extended Primer-Allele C: 

5 

Extended Primer Mass: 

Position 1973 (C/T) 
PCR primers: 

10 

Forward: 
Reverse: 

15 

MassEXTEND'" Primer: 

20 Primer Mass: 

Extended Primer-Allele C: 

25 Extended Primer Mass: 
Extended Primer-Allele T: 

30 Extended Primer Mass: 
UPC 

Position 680 (C/G) 
35 PCR primers: 

Forward: 
40 Reverse: 

MassEXTEND™ Primer: 
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6561.3 

GAGGAGAGACAAGGCAGATAGT 
(SEQ ID NO.: 100) 

6890.5 

AAAGGTTCAGTTGCTGCTGC 
{SEQ ID NO.: 101) 

GCTGGGGAAGGTCTAATAAC 
(SEQ ID NO.: 102) 

GTTGCTGCTGCCTCGAATC 
(SEQ ID NO.: 103) 

5770.7 

GTTGCTGCTGCCTCGAATCC 
(SEQ ID NO.: 104) 

6043.9 

GTTGCTGCTGCCTCGAATCTG 
(SEQ ID NO.: 105) 

6388.2 


CGTCTTTCTCCAGATGATGC 
(SEQ ID NO.: 106) 

AGTGTCCTATGGGCTGTTTG 
(SEQ ID NO.: 107) 

GGATGCCATTCATACCTTTAC 
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Primer Mass: 
5 Extended Primer-Allele C: 

Extended Primer Mass: 
10 Extended Primer-Allele G: 

Extended Primer Mass: 

15 Position 1374 (G/A) 
PCR primers: 

Forward: 

20 

Reverse: 

MassEXTEND'" Primer: 

25 

Primer Mass: 

Extended Primer-Allele A: 

30 

Extended Primer Mass: 
Extended Primer-Allele G: 

35 


{SEQ ID NO.: 108) 
6556.1 

GGATGCCATTCATACCTTTACC 
(SEQ ID NO.: 109) 

6629.3 

GGATGCCATTCATACCTTTACGC 
(SEQ ID NO.: 110) 

6958.5 


TGGGAAAACAGTGCAGTGTG 
(SEQ ID NO.: 111) 

TGATCGTCTTCAGAACGAGG 
(SEQ ID NO.: 112) 

CCAGACCATCATCCCATGGA 
(SEQ ID NO.: 113) 

6030.9 

CCAGACCATCATCCCATGGAA 
(SEQ ID NO.: 114) 

6328.1 

CCAGACCATCATCCCATGGAGC 
(SEQ ID NO.: 115) 


Extended Primer Mass: 


6633.3 
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Position 701 (G/A) 
PCR primers: 

Forward: 


Reverse: 

10 MassEXTEND™ Primer: 

Primer Mass: 
15 Extended Primer- Allele A: 

Extended Primer Mass: 
20 Extended Primer-Allele G: 

Extended Primer Mass: 


CAGCAATCGTCTTTCTCCAG 
(SEQ ID NO.: 116) 

TCCTATGGGCTGTTTGATGC 
(SEQ ID NO.: 117) 

GTCTTTCTCCAGATGATGCCA 
(SEQ ID NO.: 118) 

6372.2 

GTCTTTCTCCAGATGATGCCAA 
(SEQ ID NO.: 119) 

6669.4 

GTCTTTCTCCAGATGATGCCAGT 
(SEQ ID NO.: 120) 

6989.6 


25 E. Databases 

Databases for determining an association between polymorphic 
regions of genes and intermediate and clinical phenotypes, contain 
biological samples (e.ff., blood) that provide a source of nucleic acid and 
clinical data covering diseases (e.g., age, sex, ethnicity medical history 

30 and family medical history) from both individuals exhibiting the phenotype 
(intermediate phenotype (risk factor) or clinical phenotype (disease)) and 
those who do not. These databases include human population groups 
such as twins, diverse affected families, isolated founder populations and 
drug trial subjects. The quality and consistency of the clinical resources 

35 are of primary importance. 
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F. Association Studies 

The examples set forth below used an extreme trait analysis to 
discover an association between an allelic variant of the COX6B gene and 
high cholesterol and an association between an allelic variant of the GPI-1 
5 gene and low HDL. This analysis is based on comparing a pair of pools of 
DNA from individuals who exhibit respectively hypo or hypernormal levels 
of a biochemical trait {e.g., cholesterol or HDL) and individually examining 
SNPs for a difference in allelic frequency between the pools. An 
association is considered to be positive if a statistically significant value 

10 of at least 3.841 using a 1-degree-of-freedom chi-squared test of 

association, p = 0.05, is obtained. Standard multiple testing corrections 
are applied if more than one SNP is considered at a time, i.e., multiple 
SNPs are tested during the same study. Although not always required, it 
may be necessary to further examine the frequency of allelic variants in 

1 5 other populations, including those exhibiting normal levels of the given 
trait. 

For a qualitative trait (e.g., hypertension) association studies are 
based on determining the occurrence of certain alleles in a given 
population of diseased vs. healthy individuals. 

20 Allelic variants of COX6B, GPI-1 and other genes found to 

associate with high cholesterol, low HDL and/or cardiovascular disease 
can represent useful markers for indicating a predisposition for developing 
a risk factor for cardiovascular disease. These allelic variants may not 
necessarily represent functional variants affecting the expression, 

25 stability, or activity of the encoded protein product. Those of skill in the 
art would be able to determine which allelic variants are to be used, alone 
or in conjunction with other variants, only for indicating a predisposition 
for cardiovascular disease or for profiling of drug reactivity and for 
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determining those that may be also useful for screening for potential 
therapeutics. 

Any method used to determine association can be used to discover 
or confirm the association of other polymorphic regions in the COX6B 

5 gene, the GPI-1 gene or any other gene that may be associated with 
cardiovascular disease. 
G. Detection of Polymorphisms 

1 . Nucleic acid detection methods 

Generally, these methods are based in sequence-specific 

10 polynucleotides, oligonucleotides, probes and primers. Any method 

known to those of skill in the art for detecting a specific nucleotide within 
a nucleic acid sequence or for determining the identity of a specific 
nucleotide in a nucleic acid sequence is applicable to the methods of 
determining the presence or absence of an allelic variant of a COX6B 

15 gene or GPI-1 gene or another gene associated with cardiovascular 
disease. Such methods include, but are not limited to, techniques 
utilizing nucleic acid hybridization of sequence-specific probes, nucleic 
acid sequencing, selective amplification, analysis of restriction enzyme 
digests of the nucleic acid, cleavage of mismatched heteroduplexes of 

20 nucleic acid and probe, alterations of electrophoretic mobility, primer 
specific extension, oligonucleotide ligation assay and single-stranded 
conformation polymorphism analysis. In particular, primer extension 
reactions that specifically terminate by incorporating a dideoxynucleotide 
are useful for detection. Several such general nucleic acid detection 

25 assays are described in U.S. Patent No. 6,030,778. 
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a. Primer extension-based methods 

Several primer extension-based methods for determining the 
identity of a particular nucleotide in a nucleic acid sequence have been 
reported (see, e.g., PCT Application No. PCT/US96/03651 
5 (W096/29431), PCT Application No. PCT/US97/20444 (WO 98/20019), 
PCT Application No. PCT/US9 1/00046 (WO91/13075), and U.S. Patent 
No. 5,856,092). In general, a primer is prepared that specifically 
hybridizes adjacent to a polymorphic site in a particular nucleic acid 
sequence. The primer is then extended in the presence of one or more 

10 dideoxynucleotides, typically with at least one of the dideoxynucleotides 
being the complement of the nucleotide that is polymorphic at the site. 
The primer and/or the dideoxynucleotides may be labeled to facilitate a 
determination of primer extension and identity of the extended nucleotide. 
In one method, primer extension and/or the identity of the extended 

15 nucleotide(s) are determined by mass spectrometry (see, e.g., PCT 

Application Nos. PCT/US96/03651 (W096/29431 ) and PCT/US97/20444 

(WO 98/20019)). 

b. Polymorphism-specific probe hybridization 

One exemplary detection method is allele specific hybridization 
20 using probes overlapping the polymorphic site and having about 5, 10, 
15, 20, 25, or 30 nucleotides around the polymorphic region. The probes 
can contain aturally occurring or modified nucleotides (see U.S. Patent 
No. 6,156,501). For example, oligonucleotide probes may be prepared in 
which the known polymorphic nucleotide is placed centrally (allele- 
25 specific probes) and then hybridized to target DNA under conditions that 
permit hybridization only if a perfect match is found (Saiki et al. (1986) 
Nature 324:163; Saiki et al. (1989) Proc. Natl Acad. Sci USA 86:6230; 
and Wallace eta/. (1979) Nucl. Acids Res. 6:3543). Such allele specific 
oligonucleotide hybridization techniques may be used for the simultaneous 
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detection of several nucleotide changes in different polymorphic regions. 
For example, oligonucleotides having nucleotide sequences of specific 
allelic variants are attached to a hybridizing membrane and this membrane 
is then hybridized with labeled sample nucleic acid. Analysis of the 

5 hybridization signal will then reveal the identity of the nucleotides of the 
sample nucleic acid. In one embodiment, several probes capable of 
hybridizing specifically to allelic variants are attached to a solid phase 
support, e.g., a "chip". Oligonucleotides can be bound to a solid support 
by a variety of processes, including lithography. For example a chip can 

10 hold up to 250,000 oligonucleotides (GeneChip, Affymetrix, Santa Clara, 
CA). Mutation detection analysis using these chips comprising 
oligonucleotides, also termed "DNA probe arrays" is described e.g., in 
Cronin eta/. (1996) Human Mutation 7:244 and in Kozal eta/. (1996) 
Nature Medicine 2:753. In one embodiment, a chip includes all the allelic 

15 variants of at least one polymorphic region of a gene. The solid phase 
support is then contacted with a test nucleic acid and hybridization to the 
specific probes is detected. Accordingly, the identity of numerous allelic 
variants of one or more genes can be identified in a simple hybridization 
experiment. 

20 c. Nucleic acid amplification-based methods 

In other detection methods, it is necessary to first amplify at least a 
portion of a COX6B gene, GPI-1 gene or another gene associated with 
cardiovascular disease prior to identifying the allelic variant. Amplification 
can be performed, e.g. f by PCR and/or LCR, according to methods known 

25 in the art. In one embodiment, genomic DNA of a cell is exposed to two 
PCR primers and amplification is performed for a number of cycles 
sufficient to produce the required amount of amplified DNA. In certain 
embodiments, the primers are located between 150 and 350 base pairs 
apart. 
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Alternative amplification methods include: self sustained sequence 
replication (Guatelli, J. C. et al. (1990) Proc. Natl. Acad. Sci. U.S.A. 
87:1874-1878); transcriptional amplification system (Kwoh, D. Y. et al. 
(1989) Proc. Natl. Acad. Sci. U.S.A. 86:1173-1177); Q-Beta Replicase 

5 (Lizardi, P. M. et al. (1988) Bio/Technology 6:1 197) and any other nucleic 
acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These 
detection schemes are also useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

10 Alternatively, allele specific amplification technology, which 

depends on selective PCR amplification may be used in conjunction with 
the alleles provided herein. Oligonucleotides used as primers for specific 
amplification may carry the allelic variant of interest in the center of the 
molecule (so that amplification depends on differential hybridization) 

15 (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 
3' end of one primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) Tibtech 
1 1:238; Newton et al. (1989) Nucl. Acids Res. 17:2503). In addition it 
may be desirable to introduce a restriction site in the region of the 

20 mutation to create cleavage-based detection (Gasparini et al. (1992) Mol. 

Cell Probes 6:1). 

d. Nucleic acid sequencing-based methods 

In one embodiment, any of a variety of sequencing reactions 
known in the art can be used to directly sequence at least a portion of the 
25 COX6B gene, GPI-1 gene or other gene associated with cardiovascular 
disease and to detect allelic variants, e.g., mutations, by comparing the 
sequence of the sample sequence with the corresponding wild-type 
(control) sequence. Exemplary sequencing reactions include those based 
on techniques developed by Maxam and Gilbert (Proc. Natl. Acad. Sci. 
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USA (1977) 74:560) or Sanger (Sanger et al. (1977) Proc. Natl. Acad. Sci 
74:5463). It is also contemplated that any of a variety of automated 
sequencing procedures may be used when performing the subject assays 
(Biotechniques (1995) 19:448), including sequencing by mass 

5 spectrometry (see, for example, U.S. Patent No. 5,547,835 and 
International PCT Application No. WO 94/16101, entitled DNA 
Sequencing by Mass Spectrometry by H. Koster; U.S. Patent No. 
5,547,835 and International PCT Application No. WO 94/21822, entitled 
"DNA Sequencing by Mass Spectrometry Via Exonuclease Degradation" 

10 by H, Koster), and U.S. Pat. No. 5,605,798 and International Patent 
Application No. PCT/US96/03651 entitled DNA Diagnostics Based on 
Mass Spectrometry by H. Koster; Cohen et al. (1996) Adv Chromatogr 
36:127-162; and Griffin et al. (1993) Appl Biochem Biotechnol 
38:147-159). It will be evident to one skilled in the art that, for certain 

15 embodiments, the occurrence of only one, two or three of the nucleic 
acid bases need be determined in the sequencing reaction. For instance, 
A-track sequencing or an equivalent, e.g., where only one nucleotide is 
detected, can be carried out. Other sequencing methods are disclosed, 
e.g., in U.S. Patent No. 5,580,732 entitled "Method of DNA sequencing 

20 employing a mixed DNA-polymer chain probe" and U.S. Patent No. 
5,571,676 entitled "Method for mismatch-directed In vitro DNA 
sequencing". 

e. Restriction enzyme digest analysis 
In some cases, the presence of a specific allele in nucleic acid, 
25 particularly DNA, from a subject can be shown by restriction enzyme 
analysis. For example, a specific nucleotide polymorphism can result in a 
nucleotide sequence containing a restriction site that is absent from the 
nucleotide sequence of another allelic variant. 
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f . Mismatch Cleavage 
Protection from cleavage agents, such as, but not limited to, a 
nuclease, hydroxylamine or osmium tetroxide and with piperidine, can be 
used to detect mismatched bases in RNA/RNA DNA/DNA, or RNA/DNA 

5 heteroduplexes (Myers, et al. (1985) Science 230:1242). In general, the 
technique of "mismatch cleavage" starts by providing heteroduplexes 
formed by hybridizing a control nucleic acid, which is optionally labeled, 
e.g., RNA or DNA, comprising a nucleotide sequence of an allelic variant 
with a sample nucleic acid, e.g, RNA or DNA, obtained from a tissue 

10 sample. The double-stranded duplexes are treated with an agent, which 
cleaves single-stranded regions of the duplex such as duplexes formed 
based on basepair mismatches between the control and sample strands. 
For instance, RNA/DNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with S1 nuclease to enzymatically digest the 

15 mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in 
order to digest mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size on denaturing 

20 polyacrylamide gels to determine whether the control and sample nucleic 
acids have an identical nucleotide sequence or in which nucleotides they 
differ (see, for example, Cotton eta/. (1988) Proc. Natl Acad Sci USA 
85:4397; Saleeba et al. (1992) Methods Enzymod. 217:286-295). The 
control or sample nucleic acid is labeled for detection. 

25 g. Electrophoretic mobility alterations 

In other embodiments, alteration in electrophoretic mobility is used 
to identify the type of allelic variant in the COX6B gene, GPI-1 gene or 
other gene associated with cardiovascular disease. For example, single- 
strand conformation polymorphism (SSCP) may be used to detect 
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differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. USA 86:2766, see 
also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet 
Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and 
5 control nucleic acids are denatured and allowed to renature. The 

secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA fragments may be 
labeled or detected with labeled probes. The sensitivity of the assay may 

10 be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a change in sequence. In another 
embodiment, the subject method uses heteroduplex analysis to separate 
double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

15 h. Polyacrylamide Gel Electrophoresis 

In yet another embodiment, the identity of an allelic variant of a 
polymorphic region in the COX6B gene, GPI-1 gene or other gene 
associated with cardiovascular disease is obtained by analyzing the 
movement of a nucleic acid comprising the polymorphic region in 

20 polyacrylamide gels containing a gradient of denaturant is assayed using 
denaturing gradient gel electrophoresis (DGGE) (Myers era/. (1985) 
Nature 313:495). When DGGE is used as the method of analysis, DNA 
will be modified to ensure that it does not completely denature, for 
example by adding a GC clamp of approximately 40 bp of high-melting 

25 GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing agent gradient to identify differences in the 
mobility of control and sample DNA (Rosenbaum and Reissner (1987) 
Biophys Chem 265:1275). 
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i. Oligonucleotide ligation assay (OLA) 
In another embodiment, identification of the allelic variant is carried 
out using an oligonucleotide ligation assay (OLA), as described, e.g., in 
U.S. Patent No. 4,998,617 and in Landegren, U. etaL, Science 
5 241:1077-1080 (1988). The OLA protocol uses two oligonucleotides that 
are designed to be capable of hybridizing to abutting sequences of a 
single strand of a target. One of the oligonucleotides is linked to a 
separation marker, e.g,. biotinylated, and the other is detectably labeled. 
If the precise complementary sequence is found in a target molecule, the 

10 oligonucleotides will hybridize such that their termini abut, and create a 
ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et aL 
have described a nucleic acid detection assay that combines attributes of 
PCR and OLA (Nickerson, D. A. etaL, Proc. Natl. Acad. Sci. (U.S.A.) 

15 87:8923-8927 (1990). In this method, PCR is used to achieve the 
exponential amplification of target DNA, which is then detected using 
OLA. 

Several techniques based on this OLA method have been developed 
and can be used to detect specific allelic variants of a polymorphic region 

20 of a gene. For example, U.S. Pat. No. 5,593,826 discloses an OLA using 
an oligonucleotide having 3'-amino group and a 5'- phosphorylated 
oligonucleotide to form a conjugate having a phosphoramidate linkage. In 
another variation of OLA described in Tobe etaL (1996) Nucl. Acids Res. 
24: 3728), OLA combined with PCR permits typing of two alleles in a 

25 single microtiter well. By marking each of the allele-specific primers with a 
unique hapten, i.e. digoxigenin and fluorescein, each OLA reaction can be 
detected by using hapten specific antibodies that are labeled with 
different enzyme reporters, alkaline phosphatase or horseradish 
peroxidase. This system permits the detection of the two alleles using a 
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high throughput format that leads to the production of two different 
colors. 

j. SNP detection methods 
Also provided are methods for detecting single nucleotide 
5 polymorphisms. Because single nucleotide polymorphisms constitute sites 
of variation flanked by regions of invariant sequence, their analysis 
requires no more than the determination of the identity of the single 
nucleotide present at the site of variation and it is unnecessary to 
determine a complete gene sequence for each patient. Several methods 
10 have been developed to facilitate the analysis of such single nucleotide 
polymorphisms. 

In one embodiment, the single base polymorphism can be detected 
by using a specialized exonuclease-resistant nucleotide, as disclosed, 
e.g., in Mundy, C. R. (U.S. Patent No. 4,656,127). According to the 

15 method, a primer complementary to the allelic sequence immediately 3' to 
the polymorphic site is permitted to hybridize to a target molecule 
obtained from a particular animal or human. If the polymorphic site on the 
target molecule contains a nucleotide that is complementary to the 
particular exonuclease-resistant nucleotide derivative present, then that 

20 derivative will be incorporated onto the end of the hybridized primer. 
Such incorporation renders the primer resistant to exonuclease, and 
thereby permits its detection. Since the identity of the 
exonuclease-resistant derivative of the sample is known, a finding that 
the primer has become resistant to exonucleases reveals that the 

25 nucleotide present in the polymorphic site of the target molecule was 
complementary to that of the nucleotide derivative used in the reaction. 
This method has the advantage that it does not require the determination 
of large amounts of extraneous sequence data. 
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In another embodiment, a solution-based method for determining 
the identity of the nucleotide of a polymorphic site is employed (Cohen, 
D. etal. (French Patent 2,650,840; PCT Application No. WO91/02087)). 
As in the Mundy method of U.S. Patent No. 4,656,127, a primer is 

5 employed that is complementary to allelic sequences immediately 3' to a 
polymorphic site. The method determines the identity of the nucleotide of 
that site using labeled dideoxynucleotide derivatives, which, if 
complementary to the nucleotide of the polymorphic site will become 
incorporated onto the terminus of the primer. 

10 k. Genetic Bit Analysis 

An alternative method, known as Genetic Bit Analysis or GBA™ is 
described by Goelet, et al. (U.S. Patent No. 6,004,744, PCT Application 
No. 92/15712). The method of Goelet, etal. uses mixtures of labeled 
terminators and a primer that is complementary to the sequence 3' to a 

15 polymorphic site. The labeled terminator that is incorporated is thus 
determined by, and complementary to, the nucleotide present in the 
polymorphic site of the target molecule being evaluated. In contrast to 
the method of Cohen et al. (French Patent 2,650,840; PCT Application 
No. WO91/02087), the method of Goelet, etal. is typically a hetero- 

20 geneous phase assay, in which the primer or the target molecule is 
immobilized to a solid phase. 

I. Other primer-guided nucleotide incorporation 
procedures 

Other primer-guided nucleotide incorporation procedures for 
25 assaying polymorphic sites in DNA have been described (Komher, J. S. et 
al., Nucl. Acids Res. 17:7779-7784 (1989>; Sokolov, B. P., Nucl. Acids 
Res. 18:3671 (1990); Syvanen, A. C, etal. t Genomics 8:684-692 
(1990), Kuppuswamy, M, N. et al., Proc. Natl. Acad. Sci. (U.S.A.) 
88:1143-1147 (1991); Prezant, T. R. etal., Hum. Mutat. 1:159-164 
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(1992); Ugozzoli, L etal., GATA 9:107-112 (1992); Nyren, P. etal., 
Anal. Biochem. 208:171-175 (1993)). These methods differ from GBA tM 
in that they all rely on the incorporation of labeled deoxynucleotides to 
discriminate between bases at a polymorphic site. In such a format, since 
5 the signal is proportional to the number of deoxynucleotides incorporated, 
polymorphisms that occur in runs of the same nucleotide can result in 
signals that are proportional to the length of the run (Syvanen, A. C, et 
ai, Amer. J. Hum. Genet. 52:46-59 (1993)). 

For determining the identity of the allelic variant of a polymorphic 

10 region located in the coding region of a gene, yet other methods than 
those described above can be used. For example, identification of an 
allelic variant that encodes a mutated protein can be performed by using 
an antibody specifically recognizing the mutant protein in, e.g., 
immunohistochemistry or immunoprecipitation. Binding assays are 

15 known in the art and involve, e.g., obtaining cells from a subject, and 
performing binding experiments with a labeled lipid, to determine whether 
binding to the mutated form of the protein differs from binding to the 
wild-type protein. 

m. Molecular structure determination 

20 If a polymorphic region is located in an exon, either in a coding or 

non-coding region of the gene, the identity of the allelic variant can be 
determined by determining the molecular structure of the mRNA, 
pre-mRNA, or cDNA. The molecular structure can be determined using 
any of the above described methods for determining the molecular 

25 structure of the genomic DNA, e.g., sequencing and SSCP. 
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n. Mass spectrometry methods 

Nucleic acids also can be analyzed by detection methods and 
protocols, particularly those that rely on mass spectrometry (see, e.g., 
U.S. Patent No. 5,605,798, allowed co-pending U.S. Application Serial 
5 No. 08/617,256, allowed co-pending U.S. Application Serial No. 

08/744,481, U.S. Application Serial No. 08/990,851, International PCT 
Application No. WO 98/20019). These methods can be automated (see, 
e.g., co-pending U.S. Application Serial No. 09/285,481, which describes 
an automated process line). Among the methods of analysis herein are 
10 those involving the primer oligo base extension (PROBE) reaction with 
mass spectrometry for detection (described herein and elsewhere, see 
e.g., U.S. Application Serial Nos. 08/617,256, 09/287,681, 09/287,682, 
09/287,141 and 09/287,679, allowed co-pending U.S. Application Serial 
No. 08/744,481, international PCT Application No. PCT/US97/20444, 
15 published as International PCT Application No. WO 98/20019, and based 
upon U.S. Application Serial Nos. 08/744,481, 08/744,590, 08/746,036, 
08/746,055, 08/786,988, 08/787,639, 08/933,792, 08/746,055, 
08/786,988 and 08/787,639; see, also U.S. Application Serial No. 
09/074,936, allowed U.S. Application Serial No. 08/787,639, and U.S. 
20 Application Serial Nos. 08/746,055 and 08/786,988, and published 
International PCT Application No. WO 98/20020). 

One format for performing the analyses is a chip based format in 
which the biopolymer is linked to a solid support, such as a silicon or 
silicon-coated substrate, typicaly in the form of an addressable array. 
25 Typically when analyses are performed using mass spectrometry, 

particularly MALDI, nanoliter volumes of sample are loaded on, such that 
the resulting spot is about, or smaller than, the size of the laser spot. It 
has been found that when this is achieved, the results from the mass 
spectrometry analysis are quantitative. The area under the peaks in the 
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resulting mass spectra are proportional to concentration (when normalized 
and corrected for background). Methods for preparing and using such 
chips are described in allowed co-pending U.S. Application Serial No. 
08/787,639, co-pending U.S. Application Serial Nos. 08/786,988, 
5 09/364,774, 09/371,150 and 09/297,575; see, also U.S. Application 
Serial No. PCT/US97/20195, which published as International PCT 
Application No. WO 98/20020. Chips and kits for performing these 
analyses are commercially available from SEQUENOM under the 
trademark MassARRAY™. MassARRAY 11 * relies on the fidelity of the 
10 enzymatic primer extension reactions combined with the miniaturized 
array and MALDI-TOF (Matrix-Assisted Laser Desorption lonization-Time 
of Flight) mass spectrometry to deliver results rapidly. It accurately 
distinguishes single base changes in the size of DNA fragments relating to 
genetic variants without tags. 
15 Multiplex methods allow for the simultaneous detection of more 

than one polymorphic region in a particular gene or polymorphic regions in 
several genes. This is the one exemplary method for carrying out 
haplotype analysis of allelic variants of the COX6B and/or GPI-1 genes 
separately, or along with allelic variants of one or more other genes 
20 associated with cardiovascular disease. 

Multiplexing can be achieved by several different methodologies. 
For example, several mutations can be simultaneously detected on one 
target sequence by employing corresponding detector (probe) molecules 
(e.g., oligonucleotides or oligonucleotide mimetics). The molecular weight 
25 differences between the detector oligonucleotides must be large enough 
so that simultaneous detection (multiplexing) is possible. This can be 
achieved either by the sequence itself (composition or length) or by the 
introduction of mass-modifying functionalities into the detector 
oligonucleotides (see below). 
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Mass modifying moieties can be attached, for instance, to either 
the 5'-end of the oligonucleotide, to the nucleobase (or bases), to the 
phosphate backbone, and to the 2'-position of the nucleoside 
(nucleosides) and/or to the terminal 3'-position. Examples of mass 
5 modifying moieties include, for example, a halogen, an azido, or of the 
type, XR, wherein X is a linking group and R is a mass-modifying 
functionality. The mass-modifying functionality can thus be used to 
introduce defined mass increments into the oligonucleotide molecule. 
The mass-modifying functionality can be located at different 
10 positions within the nucleotide moiety (see, e.g., U.S. Patent No. 

5,547,835 and International PCT Application No. WO 94/21822). For 
example, the mass-modifying moiety, M, can be attached either to the 
nucleobase, (in case of the c 7 -deazanucleosides also to C-7), to the 
triphosphate group at the alpha phosphate or to the 2'-position of the 
15 sugar ring of the nucleoside triphosphate. Modifications introduced at the 
phosphodiester bond, such as with alpha-thio nucleoside triphosphates, 
have the advantage that these modifications do not interfere with 
accurate Watson-Crick base-pairing and additionally allow for the one-step 
post-synthetic site-specific modification of the complete nucleic acid 
20 molecule e.g., via alkylation reactions (see, e.g. t Nakamaye etal. (1988) 
Nucl. Acids Res. 16:9947-59). Exemplary mass-modifying functionalities 
are boron-modified nucleic acids since they are better incorporated into 
nucleic acids by polymerases (see, e.g., Porter etal. (1995) Biochemistry 
34:11963-11969; Hasan etal. (1996) Nucleic Acids Res. 24:2150-2157; 
25 Li etal. (1995) Nucl. Acids Res. 23:4495-4501). 

Furthermore, the mass-modifying functionality can be added so as 
to affect chain termination, such as by attaching it to the 3'-position of 
the sugar ring in the nucleoside triphosphate. For those skilled in the art, 
it is clear that many combinations can be used in the methods provided 
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herein. In the same way, those skilled in the art will recognize that 
chain-elongating nucleoside triphosphates also can be mass-modified in a 
similar fashion with numerous variations and combinations in functionality 
and attachment positions. 
5 For example, without being bound to any particular theory, the 

mass-modification can be introduced for X in XR as well as using 
oligo-/polyethylene glycol derivatives for R. The mass-modifying 
increment (m) in this case is 44, i.e. five different mass-modified species 
can be generated by just changing m from 0 to 4 thus adding mass units 

10 of 45 (m = 0), 89 (m=1) / 133 (m = 2), 177 (m = 3) and 221 (m = 4) to the 
nucleic acid molecule (e.g., detector oligonucleotide {D) or the nucleoside 
triphosphates, respectively). The oligo/polyethylene glycols also can be 
monoalkylated by a lower alkyl such as, but are not limited to, methyl, 
ethyl, propyl, isopropyl and t-butyl. Other chemistries can be used in the 

15 mass-modified compounds (see, e.g., those described in Oligonucleotides 
and Analogues, A Practical Approach, F. Eckstein, editor, IRL Press, 
Oxford, 1991). 

In yet another embodiment, various mass-modifying functionalities, 
R, other than oligo/polyethylene glycols, can be selected and attached via 

20 appropriate linking chemistries, X. A simple mass-modification can be 
achieved by substituting H for halogens, such as F, CI, Br and/or I, or 
pseudohalogens such as CN, SCN, NCS, or by using different alkyl, aryl 
or aralkyl moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, 
phenyl, substituted phenyl, benzyl, or functional groups such as CH 2 F, 

25 CHF 2 , CF 3 , Si(CH 3 | 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 > 2 , Si(C 2 H 5 ) 3 . Yet 
another mass-modification can be obtained by attaching homo- or 
heteropeptides through the nucleic acid molecule (e.g., detector (D)) or 
nucleoside triphosphates). One example, useful in generating mass- 
modified species with a mass increment of 57, is the attachment of 
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oligoglycines (m) to nucleic acid molecules (r), e.g., mass-modifications of 
74 (r=1, m = 0), 131 (r=1, m=1), 188 (r = 1, m = 2), 245 <r=1, m = 3) 
are achieved. Simple oligoamides also can be used, e.g., but not limited 
to, mass-modifications of 74 (r=1, m = 0), 88 (r = 2, m = 0), 102 (r = 3, 
5 m = 0), 116(r = 4, m = 0), are obtainable. Variations in additions to those 
set forth herein will be apparent to the skilled artisan. 

Different mass-modified detector oligonucleotides can be used to 
simultaneously detect all possible variants/mutants simultaneously. 
Alternatively, all four base permutations at the site of a mutation can be 
10 detected by designing and positioning a detector oligonucleotide, so that 
it serves as a primer for a DNA/RNA polymerase with varying 
combinations of elongating and terminating nucleoside triphosphates. For 
example, mass modifications also can be incorporated during the 
amplification process. 
15 A different multiplex detection format is one in which differentiation 

is accomplished by employing different specific capture sequences that 
are position-specifically immobilized on a flat surface {e.g., a 'chip array'). 
If different target sequences T1-Tn are present, their target capture sites 
TCSMCSn will specifically interact with complementary immobilized 
20 capture sequences C1-Cn. Detection is achieved by employing 

appropriately mass differentiated detector oligonucleotides D1-Dn, which 
are mass modifying functionalities M1-Mn. 
o. Other methods 
Additional methods of analyzing nucleic acids include amplification- 
25 based methods including polymerase chain reaction (PCR>, ligase chain 
reaction (LCR), mini-PCR, rolling circle amplification, autocatalytic 
methods, such as those using QJ replicase, TAS, 3SR, and any other 
suitable method known to those of skill in the art. 
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Other methods for analysis and identification and detection of 
polymorphisms, include but are not limited to, allele specific probes, 
Southern analyses, and other such analyses. 
2. Primers and probes 
5 Primers refer to nucleic acids that are capable of specifically 

hybridizing to a nucleic acid sequence that is adjacent to a polymorphic 
region of interest or to a polymorphic region and are extended. A primer 
can be used alone in a detection method, or a primer can be used 
together with at least one other primer or probe in a detection method. 
10 Primers also can be used to amplify at least a portion of a nucleic acid. 
For amplifying at least a portion of a nucleic acid, a forward primer {i.e., 
5' primer) and a reverse primer {/.e., 3' primer) typically will be used. 
Forward and reverse primers hybridize to complementary stands of a 
double stranded nucleic acid, such that upon extension from each primer, 
15 a double stranded nucleic acid is amplified. 

Probes refer to nucleic acids that hybridize to the region of interest 
and that are not further extended. For example, a probe is a nucleic acid 
that hybridizes adjacent to or at a polymorphic region of a COX6B gene, a 
GPI-1 gene or another gene associated with cardiovascular disease and 
20 that by hybridization or absence of hybridization to the DNA of a subject 
will be indicative of the identity of the allelic variant of the polymorphic 
region of the gene. Exemplary probes have a number of nucleotides 
sufficient to allow specific hybridization to the target nucleotide 
sequence. Where the target nucleotide sequence is present in a large 
25 fragment of DNA, such as a genomic DNA fragment of several tens or 
hundreds of kilobases, the size of a probe may have to be longer to 
provide sufficiently specific hybridization, as compared to a probe that is 
used to detect a target sequence that is present in a shorter fragment of 
DNA. For example, in some diagnostic methods, a portion of a COX6B 
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gene, a GPI-1 gene or another gene associated with cardiovascular 
disease may first be amplified and thus isolated from the rest of the 
chromosomal DNA and then hybridized to a probe. In such a situation, a 
shorter probe will likely provide sufficient specificity of hybridization. For 
5 example, a probe having a nucleotide sequence of about 10 nucleotides 
may be sufficient. 

Exemplary primers and probes hybridize adjacent to or at the 
polymorphic sites described in TABLES 1-3. In addition, primers include 
SEQ ID NOS.: 5, 10, 43, 48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 
10 103, 108, 113, and 118.^ 

Primers and probes (RNA, DNA (single-stranded or double- 
stranded), PNA and their analogs) described herein may be labeled with 
any detectable reporter or signal moiety including, but not limited to 
radioisotopes, enzymes, antigens, antibodies, spectrophotometry 
15 reagents, chemiluminescent reagents, fluorescent and any other light 

producing chemicals. Additionally, these probes may be modified without 
changing the substance of their purpose by terminal addition of 
nucleotides designed to incorporate restriction sites or other useful 
sequences, proteins, signal generating ligands such as acrtdinium esters, 
20 and/or paramagnetic particles. 

These probes may also be modified by the addition of a capture 
moiety (including, but not limited to para-magnetic particles, biotin, 
fluorescein, dioxigenin, antigens, antibodies) or attached to the walls of 
microtiter trays to assist in the solid phase capture and purification of 
25 these probes and any DNA or RNA hybridized to these probes. 

Fluorescein may be used as a signal moiety as well as a capture moiety, 
the latter by interacting with an anti-fluorescein antibody. 

Any probe or primer can be prepared according to methods well 
known in the art and described, e.g., in Sambrook, J. Fritsch, E.F., and 
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Maniatis, T. (1989< Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. For example, discrete 
fragments of the DNA can be prepared and cloned using restriction 
enzymes. Alternatively, probes and primers can be prepared using the 
5 Polymerase Chain Reaction (PCR) using primers having an appropriate 
sequence. 

Oligonucleotides may be synthesized by standard methods known 
in the art, e.g. by use of an automated DNA synthesizer (such as are 
commercially available from, numerous sources, such as Biosearch 

10 (Novato, CA); and Applied Biosystems (Foster City, CA)). As examples, 
phosphorothioate oligonucleotides may be synthesized by the method of 
Stein etal. ({1 988) Nucl. Acids Res. 16:3209), methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass polymer 
supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448- 

15 7451), and others. 

H. Transgenic Animals 

Methods for making transgenic animals using a variety of 
transgenes are known (see, e.g., Wagner etal. (1981) Proc. Nat. Acad. 
Sc. U.S.A. 78:5016; Stewart etal. (1982) Science 217:1046; 

20 Constantini etal. (1981) Nature 294:92; Lacy etal. (1982) Cell 34:343; 
McKnight etal. (1983) Cell 34:335; Brinstar etal. (1983) Nature 
306:332; Palmiter etal. (1982) Nature 300:611; Palmiter etal. (1982) 
Cell 29:701 and Palmiter etal. (1983) Science 222:809; and U.S. Patent 
Nos. 6,175,057; 6,180,849 and 6,133,502). 

25 Transgenic animals contain an exogenous nucleic acid sequence 

present as an extrachromosomal element or stably integrated in all or a 
portion of its cells, especially germ cells. Unless otherwise indicated, it 
will be assumed that a transgenic animal contains stable changes to the 
germline sequence. During the initial construction of the animal, 
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"chimeras" or "chimeric animals" are generated, in which only a subset of 
cells have the altered genome. Chimeras are primarily used for breeding 
purposes in order to generate the desired transgenic animal. Animals 
having a heterozygous alteration are generated by breeding of chimeras. 
5 Male and female heterozygotes are typically bred to generate homozygous 
animals. 

The exogenous gene is usually either from a different species than 
the animal host, or is otherwise altered in its coding or non-coding 
sequence. The introduced gene may be a wild-type gene, naturally 

10 occurring polymorphism {e.g., as described for COX6B, GPI-1 and other 
genes associated with cardiovascular disease) or a genetically 
manipulated sequence, for example having deletions, substitutions or 
insertions in the coding or non-coding regions. When the introduced gene 
is a coding sequence, it is usually operably linked to a promoter, which 

15 may be constitutive or inducible, and other regulatory sequences required 
for expression in the host animal. 

Transgenic animals can contain other genetic alterations in addition 
to the presence of alleles of COX6B and/or GPI-1 genes. For example, 
the genome can be altered to affect the function of the endogenous 

20 genes, contain marker genes, or contain other genetic alterations (e.g., 
alleles of other genes associated with cardiovascular disease). 

A "knock-out" of a gene means an alteration in the sequence of the 
gene that results in a decrease of function of the target gene, typically 
such that target gene expression is undetectable or insignificant. A 

25 knock-out of an endogenous COX6B or GPI-1 gene means that function 
of the gene has been substantially decreased so that expression is not 
detectable or only present at insignificant levels. "Knock-out" transgenics 
can be transgenic animals having a heterozygous knock-out of the COX6B 
or GPI-1 gene or a homozygous knock-out of one or both of these genes. 
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"Knock-outs" also include conditional knock-outs, where alteration of the 
target gene can occur upon, for example, exposure of the animal to a 
substance that promotes target gene alteration, introduction of an enzyme 
that promotes recombination at the target gene site {e.g., Cre in the Cre- 
5 lox system), or other method for directing the target gene alteration 
postnatally. 

A "knock-in" of a target gene means an alteration in a host cell 
genome that results in altered expression [e.g., increased (including 
ectopic)) of the target gene, e.g., by introduction of an additional copy of 

10 the target gene, or by operatively inserting a regulatory sequence that 
provides for enhanced expression of an endogenous copy of the target 
gene. "Knock-in" transgenics of interest can be transgenic animals having 
a knock-in of the COX6B or GPI-1. Such transgenics can be heterozygous 
or homozygous for the knock-in gene. "Knock-ins" also encompass 

15 conditional knock-ins. 

A construct is suitable for use in the generation of transgenic 
animals if it allows the desired level of expression of a COX6B or GPI-1 
encoding sequence or the encoding sequence of another gene associated 
with cardiovascular disease. Methods of isolating and cloning a desired 

20 sequence, as well as suitable constructs for expression of a selected 
sequence in a host animal, are well known in the art and are described 
below. 

For the introduction of a gene into the subject animal, it is generally 
advantageous to use the gene as a gene construct wherein the gene is 
25 ligated downstream of a promoter capable of and operably linked to 

expressing the gene in the subject animal cells. Specifically, a transgenic 
non-human mammal showing high expression of the desired gene can be 
created by microinjecting a vector ligated with said gene into a fertilized 
egg of the subject non-human mammal (e.g., rat fertilized egg) 
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downstream of various promoters capable of expressing the protein 
and/or the corresponding protein derived from various mammals (rabbits, 
dogs, cats, guinea pigs, hamsters, rats, mice and other mammals) 

Useful vectors include Escherichia coli-derived plasmids, Bacillus 

5 subtilis-derived plasmids, yeast-derived plasmids, bacteriophages such as 
lambda, phage, retroviruses such as Moloney leukemia virus, and animal 
viruses such as vaccinia virus or baculovirus. 

Useful promoters for such gene expression regulation include, for 
example, promoters for genes derived from viruses (cytomegalovirus, 

10 Moloney leukemia virus, JC virus, breast cancer virus and others), and 
promoters for genes derived from various mammals (humans, rabbits, 
dogs, cats, guinea pigs, hamsters, rats, mice and other such mammalian 
species) and birds, such as, but are not limited to, chickens {e.g., genes 
for albumin, insulin II, erythropoietin, endothelin, osteocalcin, muscular 

15 creatine kinase, platelet-derived growth factor beta, keratins K1, K10 and 
K14, collagen types I and II, atrial natriuretic factor, dopamine beta- 
hydroxylase, endothelial receptor tyrosine kinase (generally abbreviated 
Tie2), sodium-potassium adenosine triphosphorylase (generally 
abbreviated Na,K-ATPase), neurofilament light chain, metallothioneins I 

20 and HA, metalloproteinase I tissue inhibitor, MHC class I antigen 

(generally abbreviated H-2L), smooth muscle alpha actin, polypeptide 
chain elongation factor 1 alpha (EF-1 alpha), beta actin, alpha and beta 
myosin heavy chains, myosin light chains 1 and 2, myelin base protein, 
serum amyloid component, myoglobin, renin and other such proteins. 

25 The above-mentioned vectors can include a sequence for 

terminating the transcription of the desired messenger RNA in the 
transgenic animal (generally referred to as terminator); for example, gene 
expression can be manipulated using a sequence with such function 
contained in various genes derived from viruses, mammals and birds. The 
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simian virus SV40 terminator is a commonly used exemplary terminator. 
Additionally, for the purpose of increasing the expression of the desired 
gene, the splicing signal and enhancer region of each gene, a portion of 
the intron of a eukaryotic organism gene may be ligated 5' upstream of 
5 the promoter region, or between the promoter region and the translational 
region, or 3' downstream of the translational region as desired. 

A translational region for a protein of interest can be obtained using 
the entire or portion of genomic DNA of blood, kidney or fibroblast origin 
from various mammals (humans, rabbits, dogs, cats, guinea pigs, 

10 hamsters, rats, mice and others) or of various commercially available 
genomic DNA libraries, as a starting material, or using complementary 
DNA prepared by a known method from RNA of blood, kidney or 
fibroblast origin as a starting material. Also, an exogenous gene can be 
obtained using complementary DNA prepared by a known method from 

15 RNA of human fibroblast origin as a starting material. All these 
translational regions can be used in transgenic animals. 

To obtain the translational region, it is possible to prepare DNA 
incorporating an exogenous gene encoding the protein of interest in which 
the gene is ligated downstream of the above-mentioned promoter 

20 (generally upstream of the translation termination site) as a gene 
construct capable of being expressed in the transgenic animal. 

DNA constructs for random integration need not include regions of 
homology to mediate recombination. Where homologous recombination is 
desired, the DNA constructs contain at least a portion of the target gene 

25 with the desired genetic modification, and include regions of homology to 
the target locus. Conveniently, markers for positive and negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. 
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For various techniques for transfecting mammalian cells, see Keown et a/. 
(1990) Methods in Enzymology 185:527-537. 

The transgenic animal can be created by introducing a COX6B or 
GPI-1 gene construct into, for example, an unfertilized egg, a fertilized 
5 egg, a spermatozoon or a germinal cell containing a primordial germinal 
cell thereof, generally in the embryogenic stage in the development of a 
non-human mammal (typically in the single-cell or fertilized cell stage and 
generally before the 8-cell phase), by standard means, such as the 
calcium phosphate method, the electric pulse method, the lipofection 
10 method, the agglutination method, the microinjection method, the particle 
gun method, the DEAE-dextran method and other such method. Also, it 
is possible to introduce a desired COX6B or GPI-1 gene into a, for 
example, somatic cell, a living organ, a tissue cell, for example, by gene 
transformation methods, and use it for cell culture, tissue culture and 
15 other such uses. Furthermore, these cells may be fused with the above- 
described germinal cell by a commonly known cell fusion method to 
create a transgenic animal. 

For embryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g. mouse, rat, 
20 guinea pig, and other mammals and birds. Such cells are grown on an 
appropriate fibroblast-feeder layer or grown in the presence of appropriate 
growth factors, such as leukemia inhibiting factor (LIF). When ES cells 
have been transformed, they may be used to produce transgenic animals. 
After transformation, the cells are plated onto a feeder layer in an 
25 appropriate medium. Cells containing the construct may be detected by 
employing a selective medium. After sufficient time for colonies to grow, 
they are picked and analyzed for the occurrence of homologous 
recombination or integration of the construct. Those colonies that are 
positive may then be used for embryo manipulation and blastocyst 
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injection. Blastocysts are obtained from 4 to 6 week old superovulated 
females. The ES cells are trypsinized, and the modified cells are injected 
into the blastocoel of the blastocyst. After injection, the blastocysts are 
returned to each uterine horn of pseudopregnant females. Females are 
5 then allowed to go to term and the resulting litters screened for mutant 
cells having the construct. By providing for a different phenotype of the 
blastocyst and the ES cells, chimeric progeny can be readily detected. 
The chimeric animals are screened for the presence of the modified gene 
and males and females having the modification are mated to produce 

10 homozygous progeny. If the gene alterations cause lethality at some 
point in development, tissues or organs can be maintained as allogeneic 
or congenic grafts or transplants, or in in vitro culture. 

Animals containing more than one transgene, such as allelic 
variants of COX6B and/or GPI-1 and/or other genes associated with 

15 cardiovascular disease can be made by sequentially introducing individual 

alleles into an animal in order to produce the desired phenotype 

{manifestation or predisposition to cardiovascular disease). 

I. Effect of Allelic Variants on the Encoded Protein and Disease 
Related Phenotype 

20 The effect of an allelic variant on a COX6B or GPI-1 protein (altered 

amount, stability, location and/or activity) can be determined according to 
methods known in the art. Allelic variants of the COX6B and GPI-1 genes 
can be assayed individually or in combination with other variants known 
to be associated with cardiovascular disease. 

25 If the mutation is located in an intron, the effect of the mutation can be 
determined, e.g., by producing transgenic animals in which the allelic 
variant linked to lipid metabolism and/or cardiovascular disease has been 
introduced and in which the wild-type gene or predominant allele may 
have been knocked out. Comparison of the level of expression of the 
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protein in the mice transgenic for the allelic variant with mice transgenic 
for the predominant allele will reveal whether the mutation results in 
increased or decreased synthesis of the associated protein and/or aberrant 
tissue distribution of the associated protein. Such analysis could also be 

5 performed in cultured cells, in which the human variant allele gene is 
introduced and, e.g., replaces the endogenous gene in the cell. Thus, 
depending on the effect of the alteration a specific treatment can be 
administered to a subject having such a mutation. Accordingly, if the 
mutation results in decreased production of a COX6B or GPI-1 protein, 

10 the subject can be treated by administration of a compound that increases 
synthesis, such as by increasing COX6B or GPI-1 gene expression, and 
wherein the compound acts at a regulatory element different from the one 
that is mutated. Alternatively, if the mutation results in increased COX6B 
or GPI-1 protein levels, the subject can be treated by administration of a 

15 compound that reduces protein production, e.g., by reducing COX6B or 
GPI-1 gene expression or a compound that inhibits or reduces the activity 
of COX6B or GPI-1 protein. 
J. Diagnostic and Prognostic Assays 

Typically, an individual allelic variant that associates with a risk 

20 factor for cardiovascular disease will not be used in isolation as a 
prognosticator for a subject developing high cholesterol, low HDL or 
cardiovascular disease. An allelic variant typically will be one of a 
plurality of indicators that are used. The other indicators may be the 
manifestation of other risk factors for cardiovascular disease, e.g., family 

25 history, high blood pressure, weight, activity level and other indicators, or 
additional allelic variants in the same or other genes associated with 
altered lipid metabolism and/or cardiovascular disease. 

Useful combinations of allelic variants of the COX6B gene and/or 
the GPI-1 gene can be determined by examining combinations of variants 
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of these genes, which are assayed individually or assayed simultaneously 
using multiplexing methods as described above or any other labelling 
method that allows different variants to be identified. In particular, 
variants of COX6B gene and/or the GPI-1 gene may be assayed using kits 
5 (see below) or any of a variety microarrays known to those in the art. For 
example, oligonucleotide probes comprising the polymorphic regions 
surrounding any polymorphism in the COX6B or GPI-1 gene may be 
designed and fabricated using methods such as those described in U.S. 
Patent Nos. 5,492,806; 5,525,464; 5,695,940; 6,018,041; 6,025,136; 
10 WO 98/30883; WO 98/56954; WO99/09218; WO 00/58516; 

WO 00/58519, or references cited therein. Similarly one of skill in the art 
can determine useful combinations of allelic variants of the COX6B and/or 
GPI-1 genes along with variants of other genes associated with 
cardiovascular disease. 
15 K . Pharmacogenomics 

Subjects having one or more different allelic variants of the COX6B 
or GPI-1 polymorphic regions will respond differently to therapeutic drugs 
to treat cardiovascular disease or conditions. For example, there are 
numerous drugs available for lowering cholesterol levels: including 
20 lovastatin (MEVACOR; Merck & Co.), simvastatin (XOCOR; Merck & Co.), 
dextrothyroxine (CHOLOXIN; Knoll Pharmaceutical Co.), pamaqueside 
(Pfizer), cholestryramine (QUESTRAN; Bristol-Myers Squibb), colestipol 
(COLESTID; Pharmacia & Upjohn), acipomox (Pharmacia & Upjohn), 
fenofibrate (UPIDIL), gemfibrozil (LOPID; Warner-Lambert), cerivastatin 
25 (LIPOBAY; Bayer), fluvastatin (LESCOL; Novartis), atorvastatin (LIPITOR, 
Warner-Lambert), etofylline clofibrate (DUOLIP; Merckle (Germany)), 
probucol (LORELCO; Hoechst Marion Roussel), omacor (Pronova 
(Norway), etofibrate (Merz (Germany), clofibrate (ATROMID-S; Wyeth- 
Ayerst (AHP», and niacin (numerous manufacturers). All patients do not 
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respond identically to these drugs. Alleles of the COX6B or the GPI-1 
gene that associate with altered lipid metabolism will be useful alone or in 
conjunction with markers in other genes associated with the development 
of cardiovascular disease to predict a subject's response to a therapeutic 

5 drug. For example, multiplex primer extension assays or microarrays 
comprising probes for alleles are useful formats for determining drug 
response. A correlation between drug responses and specific alleles or 
combinations of alleles of the COX6B or GPI-1 genes and other genes 
associated with cardiovascular disease can be shown, for example, by 

10 clinical studies wherein the response to specific drugs of subjects having 
different allelic variants of polymorphic regions of the COX6B or GPI-1 
genes alone or in combination with allelic variants of other genes are 
compared. Such studies also can be performed using animal models, 
such as mice having various alleles and in which, e.g., the endogenous 

15 COX6B or GPI-1 genes have been inactivated such as by a knock-out 
mutation. Test drugs are then administered to the mice having different 
alleles and the response of the different mice to a specific compound is 
compared. Accordingly, assays, microarrays and kits are provided for 
determining the drug that will be best suited for treating a specific disease 

20 or condition in a subject based on the individual's genotype. For 

example, it will be possible to select drugs that will be devoid of toxicity, 
or have the lowest level of toxicity possible for treating a subject having a 
disease or condition, e.g., cardiovascular disease or high cholesterol or 
low HDL. 

25 L. Kits 

Kits can be used to indicate whether a subject is at risk of 
developing high cholesterol, low HDL and/or cardiovascular disease. The 
kits also can be used to determine if a subject who has high cholesterol or 
low HDL carries associated variants in the COX6B or GPI-1 genes or other 
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cardiovascular disease-related genes. This information could be used, 
e.g., to optimize treatment of such individuals as a particular genotype 
may be associated with drug response. 

In certain, the kits include a probe or primer that is capable of 
5 hybridizing adjacent to or at a polymorphic region of a COX6B or GPI-1 
gene and thereby identifying whether the COX6B or GPI-1 gene contains 
an allelic variant that is associated with cardiovascular disease. Primers 
or probes that specifically hybridize at or adjacent to the SNPs described 
in Tables 1-3 could be included. In particular, primers or probes that 

10 contain the sequences of SEQ ID NOs.: 5, 10, 43, 48, 53, 58, 63, 68, 
73, 78, 83, 88, 93, 98, 103, 108, 113, and 118 could be included in the 
kits. The kits optionally also include instructions for use in carrying out 
assays, interpreting results and diagnosing a subject as having a 
predisposition toward developing high cholesterol, low HDL and/or 

15 cardiovascular disease. 

Exemplary kits for amplifying a region of a COX6B gene, GPI-1 
gene, or other genes associated with cardiovascular disease {such as 
those listed in Table 3) contain two primers that flank a polymorphic 
region of the gene of interest. For example primers can include the 

20 sequences of SEQ ID NOs.: 3, 4, 8, 9, 41, 42, 46, 47, 51, 52, 56, 57, 
61, 62, 66, 67, 71, 72, 76, 77, 81, 82, 86, 87, 91, 92, 96, 97, 101, 

1 02, 1 06, 1 07, 1 1 1 , 1 1 2, 1 1 6, and 1 1 7. For other assays, primers or 
probes hybridize to a polymorphic region or 5' or 3' to a polymorphic 
region depending on which strand of the target nucleic acid is used. For 

25 example, specific probes and primers contain sequences designated as 
SEQ ID NOs: 5, 10, 43, 48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 

103, 108, 113, and 118. Those of skill in the art can synthesize primers 
and probes that hybridize adjacent to or at the polymorphic regions 
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described in TABLES 1-3 and other SNPs in genes associated with 
cardiovascular disease. 

Yet other kits contain at least one reagent necessary to perform an 
assay. For example, the kit can comprise an enzyme, such as a nucleic 
5 acid polymerase. Alternatively the kit can contain a buffer or any other 
necessary reagent. 

Yet other kits contain microarrays of probes to detect allelic 
variants of COX6B, GPI-1, and other genes associated with 
cardiovascular disease. The kits further contain instructions for their use 

10 and interpreting the results. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. The practice of 
methods and development of the products provided herein employ, unless 
otherwise indicated, conventional techniques of cell biology, cell culture, 

15 molecular biology, transgenic biology, microbiology, recombinant DNA, 
and immunology, which are within the skill of the art. Such techniques 
are explained fully in the literature. See, for example, Molecular Cloning 
A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis 
(Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I 

20 and II (D.N. Glover ed., 1985); Oligonucleotide Synthesis (M.J. Gait ed., 
1984); Mullis et al. U.S. Patent No. 4,683,195; Nucleic Acid 
Hybridization (B.D. Hames & SJ. Higgins eds. 1984); Transcription and 
Translation (B.D. Hames & S.J. Higgins eds. 1984); Culture of Animal 
Cells (R.I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells and 

25 Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To Molecular 
Cloning (1984); the treatise, Methods In Enzymology (Academic Press, 
Inc., New York); Gene Transfer Vectors For Mammalian Cells (J.H. Miller 
and M.P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al, eds., Immunochemical 
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Methods In Cell and Molecular Biology (Mayer and Walker, eds. r 
Academic Press, London, 1987); Handbook of Experimental Immunology, 
Volumes l-IV (D.M. Weir and C.C. Blackwell, eds., 1986); Manipulating 
the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
5 Harbor, N.Y., 1986). 


isolation of DNA from blood samples of a stratified population 

Blood samples were obtained from a population of unrelated 
Caucasian women between the ages of 18-79 (average age = 48). The 

10 women had, no response to media campaigns, attended the Twin 

Research Unit at the St. Thomas Hospital in London, England. For current 
purposes, only one member of a twin pair was used to insure that all 
observations were independent. Blood samples from 1400 unrelated 
individuals were measured for levels of cholesterol and HDL. 

15 Cholesterol and HDL level in blood samples were quantitated using 
standard assay methods. 

The population was stratified into pools of 200 people, which 
represented the lower extreme and the upper extreme for serum levels of 
cholesterol and HDL. 

20 Cholesterol 


EXAMPLE 1 


Pool 1: 


Pool 2: 


Individuals were considered to have low 
cholesterol (0.12 - 3.6 mmoles/L). 
Individuals were considered to have high 
cholesterol (5.25 - 1 1 .57 mmoles/L). 
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HDL 

Pool 3: Individuals were considered to have low levels 

of HDL (0.240 -1.11 mmoles/L) 
Pool 4: Individuals were considered to have high levels 
5 of HDL (2.10 - 3.76 mmoles/L). 

DNA extraction protocol 

DNA was extracted from blood samples of each of the pools by 
utilizing the following protocol. 
Section 1 

10 1 . Blood was extracted into EDTA tubes. 

2. Blood sample was spun at 3,000 rpm for 10 minutes in a 
clinical centrifuge. 

3. The buffy coat (the leucocytes, a yellowish layer of cells on 
top of the red blood cells) was removed and pooled into a 1 

15 ml conical tube. 

4. 0.9% saline was added to fill the tube and resuspend the 
leucocytes. Sample were immediately further processed or 
stored at 4°C for 24 hrs. 

5. The sample was spun at 2,500 rpm for 10 minutes. 

20 6. The buffy coat was again removed as cleanly as possible 

leaving behind any red cells, the sample was suspended in 
red cell lysis buffer and left for 20 minutes at 4°C. 

7. The sample was spun again at 2,500 rpm for 10 minutes. If 
a pellet of unlysed red cells remained lying above the 

25 leucocytes the treatment with red cell lysis buffer was 

repeated. 

8. The leucocyte pellet was resuspended in 2 ml 0.9% saline. 

9. The DNA was liberated by the addition of leucocyte lysis 
buffer - the tube was capped and gently inverted several 
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times, until the liquid became viscous with DNA. The 
samples were handled with care to avoid shearing and 
damage to the DNA. 

10. Samples were frozen for storage prior to full extraction. 
5 Section 2 

11. 2 ml of 5 M sodium perchlorate was added to the thawed 
sample and mixed by inversion. The sample was heated to 
60°C for 30 - 40 minutes to fully denature proteins. 

12. An equal volume of chloroform/isoamyl alcohol {24:1 ) was 
10 added at room temperature and the sample mixed for 10 

minutes. 

1 3. The sample was spun without a break at 3,000 rpm for 10 
minutes. 

14. The top aqueous phase was removed into a clean tube and 
15 two volumes of cold 100% ethanol added and mixed by 

inversion to precipitate DNA. 

15. The DNA was removed using a sterile loop and resuspended 
in 1-5 ml TE buffer depending on the DNA yield. 

16. The optical density was measured at 260 and 280 nm to 
20 check yield and purity of the DNA sample. For use in 

Examples 2 and 3, all DNA had an absorbance ratio of 1 .6 at 
260/280, a total yield of 32 fjq and a concentration of 1 0 
ng/jt/l. If initial purity levels were unacceptable a re- 
extraction was carried out (sections 12-15 above). 
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EXAMPLE 2 

Detection of an Association Between an SNP at Position 86 of the Human 
COX6B Gene and High Cholesterol 

DNA samples (as prepared in Example 1), representing 200 

5 women, from the lower extreme, pool 1 (low levels of cholesterol) and 
the upper extreme, pool 2 (high levels of cholesterol) were amplified and 
analyzed for genetic differences using a MassEXTEND™ assay detection 
method. For each pool, single nucleotide polymorphisms were examined 
throughout the entire genome to detect differences in allelic frequency of 

10 a variant allele between the pools, 

PCR Amplification of Samples from Po ols 1 and 2 

PCR primers were synthesized by Operon (Alameda, CA) using 
phosphoramidite chemistry. Amplification of the COX6B target sequence 
was carried out in two 50 /yl PCR reactions with 100 ng of pooled human 

15 genomic DNA, obtained as described in Example 1, taken from samples in 
pool 1 or pool 2, although amounts ranging from 100 ng to 1 ug could be 
used. Individual DNA concentrations within the pooled samples were 
present in equal concentration with a final concentration of 0.5 ng. Each 
reaction contained 1X PCR buffer (Qiagen, Valencia, CA), 200 jjM dNTPs, 

20 1U Hotstar Taq polymerase (Qiagen, Valencia, CA), 4 mM MgCI 2 , and 25 
pmols of the long primer containing both the universal primer sequence 
and the target specific sequence 

5'-AGCGGATAACAATTTCACACAGGTAGTCTGGTTCTGGTTGGGG-3' 
(SEQ ID NO.: 4) , 2 pmoles of the short primer 
25 5'-AGGATTCAGCACCATGGC-3' (SEQ ID NO.: 3) and 10 pmoles of a 
biotinylated universal primer complementary to the 5' end of the PCR 
amplicon 5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO.: 121). 
Alternatively, the biotinylated universal primer could be 5'- 
GGCGCACGCCTCCACG-3' (SEQ ID NO.: 122). After an initial round of 
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ampiification with the target with the specific forward (long) and reverse 
primer (short), the 5' biotinylated universal primer then hybridized and 
acted as a reverse primer thereby introducing a 3' biotin capture moiety 
into the molecule. The amplification protocol results in a 5'-biotinylated 
5 double stranded DNA amplicon and dramatically reduces the cost of high 
throughput genotyping by eliminating the need to 5' biotin label each 
forward primer used in a genotyping. Thermal cycling was performed in 
0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler 
(Waltham, MA) (calculated temperature) with the following cycling 

10 parameters: 94°C for 5 min; 45 cycles: 94°C for 20 sec, 56°C for 30 
sec, 72°C for 60 sec; 72°C 3 min. 
Immobilization of DNA 

The 50//I PCR reaction was added to 25//1 of streptavidin coated 
magnetic bead (Dynal, Lake Success, NY) prewashed three times and 

15 resuspended in 1 M NH 4 CI, 0.06 M NH 4 OH. The PCR amplicons were 
allowed to bind to the beads for 1 5 minutes at room temperature. The 
beads were then collected with a magnet and the supernatant containing 
unbound DNA was removed. The unbound strand was released from the 
double stranded amplicons by incubation in 100 mM NaOH and washing 

20 of the beads three times with 10 mM Tris pH 8.0. 
Genotvoing 

The frequency of the alleles at position 86 in the COX6B gene was 
measured using the MassEXTEND™ assay and MALDI-TOF. The SNP 
identified at position 86 of COX6B in the GenBank sequence is 
25 represented as a C to T transversion. The MassEXTEND™ assay used 
detected the sequence of the complementary strand, thus the SNP was 
represented as G to A in the primer extension products. The DNA coated 
magnetic beads were resuspended in 26 mM Tris-HCL pH 9.5, 6.5 mM 
MgCI 2 and 50 mM each of dTTPs and 50 mM each of ddCTP, ddATP, 


WO 02/072604 



PCT/US02/06728 


-75- 

ddGTP, 2.5 U of a thermostable DNA polymerase (Amersham Pharmacia 
Biotech, Piscataway, NJ) and 20 pmoles of a template specific 
oligonucleotide primer 5'-AATCAAGAACTACAAGAC-3' (SEQ ID NO.: 5) 
{Operon, Alameda, CA). Primer extension occurred with three cycles of 
5 oligonucleotide primer hybridization and extension. The extension 

products were analyzed after denaturation from the template with 50 mM 
NH 4 CI and transfer of 1 50 nl of each sample to a silicon chip preloaded 
with 150 nl of H3PA (3-hydroxy picolinic acid) (Sigma Aldrich, St Louis, 
MO) matrix material. The sample material was allowed to crystallize and 
10 analyzed by MALDI-TOF (Bruker Daltonics, Billerica, MA; PerSeptive, 
Foster City, CA). The mass of the primer used in the MassEXTEND™ 
reaction was 5493.70 daltons. The predominant allele is extended by the 
addition of ddC, which has a mass of 5766.90 daltons. The allelic variant 
results in the addition of dT and ddG to the primer to produce an 
15 extension product having a mass of 61 1 1 .10 daltons. 

In addition to being analyzed as part of a pool, each individual 
sample (0.5 ng> was amplified as described above and analyzed 
individually using a MassEXTEND™ reaction as described above. 

Pooled populations of women (200 women per pool) with high 
20 cholesterol (pool 2) showed an increase in the frequency of the A allele at 
nucleotide position 86 of COX6B as compared with those with low levels 
of cholesterol (pool 1) (see Fig. 1). The association of this allelic variant 
of the COX6B gene with high cholesterol gave a statistically significant 
value of 14.30 using a 1-degree-of-freedom chi-squared test of 
25 association. In other words, the increase of 2.75% to 9.05% is 

significant, with a p value of 0.000156 (see Fig. 1). The genotype of 
each of the individuals in the pooled population was also determined by 
carrying out MassEXTEND™ reactions on each DNA samples individually. 
These analysis confirmed the pooling data showing that there was an 
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increase in the frequency of the A allele of 2.27% to 9.93%, 
(p = 0.0000061). The genotypes in pool 2 showed a decrease in the 
homozygous GG genotype from 95.4% to 82.35% and an increase in the 
heterozygous GA genotype from 4.55% to 15.44%. None of the 
5 individuals with low levels of serum cholesterol exhibited the homozygous 
AA genotype. 

EXAMPLE 3 

Detection of an Association Between an SNP at Position 2577 of the 
Human GPI-1 Gene and Low HDL 

10 DNA samples (as prepared in Example 1), representing 200 

women, from pool 3 (low level of HDL) and pool 4 (high levels of HDL) 
were amplified and analyzed for genetic differences using a 
MassEXTEND™ detection method. For each pool, SNPs were examined 
throughout the genome to detect differences in allelic frequency of variant 

15 alleles between the pools. 

PCR Amplification of Samples from Pools 3 and 4 

PGR primers were synthesized by Operon (Alameda, CA) using 
phosphoramidite chemistry. Amplification of the GPI-1 target sequence 
was carried out in single 50/il PCR reaction with 1 00 ng of pooled human 

20 genomic DNA (200 samples), obtained as described in Example 1, taken 
from samples in pool 3 or pool 4, although amounts ranging from 100 ng 
to 1 ug could be used. Individual DNA concentrations within the pooled 
samples were present in equal concentration with the final concentration 
of 0.5 ng. Each reaction contained 1X PCR buffer (Qiagen, Valencia, 

25 CA), 200 uM dNTPs, 1U Hotstar Taq polymerase (Qiagen, Valencia, CA), 
4 mM MgCI 2 , and 25 pmols of the forward primer containing both the 
universal primer sequence and the target specific short sequence 
5'-AGCAGGGCTTCCTCCTTC-3' (SEQ ID NO.: 8) 2 pmoles of the long 
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primer 


5'-AGCGGATAACAATTTCACACAGGTGACCCAGCCGTACCTATTC-3' 


(SEQ ID NO.: 9) and 10 pmoles of a biotinylated universal primer 
complementary to the 5' end of the PCR amplicon 
5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO.: 121). After an 
5 initial round of amplification with the target with the specific forward 
(long) and reverse primer (short), the 5' biotinylated universal primer then 
hybridized and acted as a reverse primer thereby introducing a 3' biotin 
capture moiety into the molecule. The amplification protocol results in a 
5'-biotinylated double stranded DNA amplicon and dramatically reduces 
10 the cost of high throughput genotyping by eliminating the need to 5' 
biotin label each forward primer used in a genotyping. Thermal cycling 
was performed in 0.2 mL tubes or 96 well plate using an MJ Research 
Thermal Cycler (Watham, MA) (calculated temperature) with the following 
cycling parameters: 94°C for 5 min; 45 cycles: 94°C for 20 sec, 56°C 
15 for 30 sec, 72°C for 60 sec; 72°C 3 min. 
Immobilization of DNA 

The 50 //I PCR reaction was added to 25 y\ of streptavidin coated 
magnetic bead {Dynal, Lake Success, NY) prewashed three times and 
resuspended in 1 M NH 4 CI, 0.06 M NH 4 0H. The PCR amplicons were 
20 allowed to bind to the beads for 15 minutes at room temperature. The 
beads were then collected with a magnet and the supernatant containing 
unbound DNA was removed. The unbound strand was released from the 
double stranded amplicons by incubation in 100 mM NaOH and washing 
of the beads three times with 10 mM Tris pH 8,0. 
25 Genotyping 

The frequency of the alleles at position 2577 in the GPI-1 gene was 
measured using the MassEXTEND™ assay and MALDI-TOF. The SNP 
identified at position 2577 of GPI-1 in the GenBank sequence is 
represented as a G to A transversion. The MassEXTEND™ assay used 
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detected this sequence, thus the SNP was represented as C to T in the 
primer extension products. The DNA coated magnetic beads were 
resuspended in 26 mM Tris-HCL pH 9.5, 6.5 mM MgCI 2 and 50 mM each 
of dTTPs and 50 mM each of ddCTP, ddATP, ddGTP, 2.5 U of a 
5 thermostable DNA polymerase (Amersham Pharmacia Biotech, 

Piscataway, NJ) and 20 pmoles of a template specific oligonucleotide 
primer 5'-AAGGGAGACAGATTTGGC-3' (SEQ ID NO.: 10) (Operon, 
Alameda, CA). Primer extension occurred with three cycles of 
oligonucleotide primer hybridization and extension. The extension 
10 products were analyzed after denaturation from the template with 50 mM 
NH 4 CI and transfer of 150 nl each sample to a silicon chip preloaded with 
150 nl of H3PA matrix material. The sample material was allowed to 
crystallize and analyzed by MALDI-TOF (Bruker Daltonics, Billerica, MA; 
PerSeptive, Foster City, CA). The mass of the primer used in the 
15 MassEXTEND™ reaction was 5612.70 daltons. The predominant allele is 
extended by the addition of ddC, which has a mass of 5885.90 daltons. 
The allelic variant results in the addition of dT and ddG to the primer to 
produce an extension product having a mass of 6230.10 daltons. 

In addition to being analyzed as a pool, each individual sample (0.5 
20 ng) was amplified as described above and analyzed individually using the 
MassEXTEND™ reaction as described above. 

Pooled populations of women (200 women per pool) with low HDL 
(pool 3) showed an increase in the T allele of 1 1 .33% at nucleotide 
position 2577 as compared with those with high levels of HDL (pool 4). 
25 The association of this allelic variant of the GPI-1 gene with low HDL 

gave a statistically significant value of 15.04 using a 1 -degree-of-f reedom 
chi-squared test of association. In other words, the increase of 16.23% 
to 27.57% is significant, with a p value of 0.0001064 (see Fig. 2). The 
genotype of each of the individuals in the pooled population was also 
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determined by carrying out individual MassEXTEND™ reactions on 
individual DNA samples. These analysis confirmed the pooling data 
showing that there was an increase in the frequency of the T allele of 
19.49% to 26.1%, (p = 0.024). The measured genotypes in pool 3 

5 showed a decrease in the homozygous CC genotype from 65.24% to 
54.21 % and an increase in the heterozygous CT genotype from 30.51 % 
to 39.25%. The homozygous TT genotypes increased 2.3%. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 

10 claims. 
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WHAT IS CLAIMED: 

1 . A method for detecting the presence or absence in a subject 
of at least one allelic variant of a polymorphic region of a gene associated 
with cardiovascular disease, comprising: 

5 the step of detecting the presence or absence of an allelic variant 

of a polymorphic region of a cytochrome C oxidase subunit Vlb (COX6B) 
gene of the subject that is associated with high serum cholesterol or an 
allelic variant of a polymorphic region of a N-acetylglucosaminyl 
transferase component (GPI-1) gene of the subject that is associated with 
10 low serum high density lipoprotein (HDL). 

2. The method of claim 1, wherein the allelic variant is of a 
polymorphic region of the cytochrome C oxidase subunit Vlb (COX6B) 
gene. 

3. The method of claim 1, wherein the allelic variant is of a 

15 polymorphic region of the N-acetylglucosaminyl transferase component 
(GPI-1) gene. 

4. The method of any of claims 1-3, further comprising 
detecting the presence or absence in a subject of least one allelic variant 
of another gene associated with cardiovascular disease. 

20 5. The method of claim 4, wherein the other gene is selected from 

the group consisting of cholesterol ester transfer protein, plasma (CETP); 
apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a 
gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 

25 (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 

methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

6. The method of claim 2 or claim 3, wherein the polymorphic 
30 region is a single nucleotide polymorphism (SNP). 
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7. The method of any of claims 1-6, wherein the detection is 
effected by detecting a a light producing reagent. 

8. The method of claim 6, wherein the SNP is at position 86 of 
the cytochrome C oxidase subunit Vlb (COX6B) gene coding sequence 

5 and the allelic variant is represented by a T nucleotide in the sense strand 
or an A nucleotide in the corresponding position in the antisense strand. 

9. The method of claim 6, wherein the SNP is at position 2577 
of the N-acetylgluocsaminyl transferase component GPI-1 (GPI-1) gene 
sequence and the allelic variant is represented by an A nucleotide in the 

10 sense strand or a T nucleotide in the corresponding position in the 
antisense strand. 

10. The method of any of claims 1-3, wherein the detecting step 
is by a method selected from the group consisting of allele specific 
hybridization, primer specific extension, oligonucleotide ligation assay, 

15 restriction enzyme site analysis and single-stranded conformation 
polymorphism analysis. 

1 1 . The method of claim 8, further comprising: 

(a) hybridizing a target nucleic acid comprising a 
cytochrome C oxidase subunit Vlb (COX6B)-encoding nucleic acid 

20 or fragment thereof with a nucleic acid primer that hybridizes 

adjacent to nucleotide 86 of the coding sequence of the COX6B 
gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 

25 (c) determining the mass of the extended primer to 

identify the nucleotide present at position 86, thereby determining 
the presence or absence of the allelic variant. 

12. The method of claim 9, further comprising: 

(a) hybridizing a target nucleic acid comprising a N- 
30 acetylglucosaminyl transferase component GPI-1 (GPI-1 )-encoding 
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nucleic acid or fragment thereof with a nucleic acid primer that 
hybridizes adjacent to nucleotide 2577 of the GP1-1 gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 
5 (c) determining the mass of the extended primer to 

identify the nucleotide present at position 2577, thereby 
determining the presence or absence of the allelic variant. 
1 3. The method of any of claims 1-1 2, wherein the detecting 
step comprises mass spectrometry, 
10 14. The method of any of claims 1-6 and 8-1 2, wherein the 

detection is effected by detecting a signal moiety selected from the group 
consisting of radioisotopes, enzymes, antigens, antibodies, 
spectrophotometry reagents, chemiluminescent reagents and fluorescent 
reagents. 

15 15. The method of claim 11 or claim 12, wherein the nucleic acid 

primer is extended in the presence of at least one dideoxynucleotide. 

1 6. The method of claim 1 5 or claim 1 6, wherein the • 
dideoxynucleotide is dideoxyguanosine <ddG). 

17. The method of claim 11, wherein the primer is extended in 
20 the presence at least two dideoxynucleotides and the dideoxynucleotides 

are dideoxyguanosine (ddG) and dideoxycytosine (ddC). 

18. The method of claim 12, wherein the primer is extended in 
the presence of at least two dideoxynucleotides and the dideoxy- 
nucleotides are dideoxyguanosine (ddG) and dideoxycytosine (ddC). 

25 19. A method for indicating a predisposition to cardiovascular 

disease in a subject, comprising: 

the step of detecting in a target nucleic acid obtained from the 
subject the presence or absence of at least one allelic variant of 
polymorphic regions of a cytochrome C oxidase subunit Vlb (COX6B) 

30 gene associated with high serum cholesterol or at least one allelic variant 
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of polymorphic regions of a N-acetylglucosaminyl transferase component 
GPI-1 (GPI-1) gene associated with low serum HDL wherein the presence 
of an allelic variant is indicative of a predisposition to cardiovascular 
disease compared to a subject who does not comprise the allelic variant. 
5 20. The method of claim 19, wherein the allelic variant is of a 

polymorphic region of the cytochrome C oxidase subunit Vlb (COX6B) 
gene. 

21. The method of claim 19, wherein the allelic variant is of a 
polymorphic region of the N-acetylglucosaminyl transferase component 

10 GPI-1 (GPI-1) gene. 

22. The method of claim 20 or claim 21 , wherein the 
polymorphic region is a single nucleotide polymorphism (SNP). 

23. The method of claim 22, wherein the SNP is at position 86 
of the cytochrome C oxidase subunit Vlb (COX6B) gene coding sequence 

15 and the allelic variant is represented by a T nucleotide in the sense strand 
or an A nucleotide in the corresponding position in the antisense strand. 

24. The method of claim 22, wherein the SNP is at position 
2577 of the N-acetylgluosaminyl transferase component GPI-1 (GPI-1) 
gene sequence and the allelic variant is represented by an A nucleotide in 

20 the sense strand or a T nucleotide in the corresponding position in the 
antisense strand. 

25. The method of claim 19, wherein the detecting step is by a 
method selected from the group consisting of allele specific hybridization, 
primer specific extension, oligonucleotide ligation assay, restriction 

25 enzyme site analysis and single-stranded conformation polymorphism 
analysis. 
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26. The method of claim 23, further comprising: 

(a) hybridizing a target nucleic acid comprising a cytochrome C 
oxidase subunit Vlb (COX6B)-encoding nucleic acid or fragment thereof 
with a nucleic acid primer that hybridizes adjacent to nucleotide 86 of the 
5 coding sequence of the COX6B gene; 

{b> extending the nucleic acid primer using the target nucleic 
acid as a template; and 

(c| determining the mass of the extended primer to identify the 
nucleotide present at position 86, thereby determining the presence or 
10 absence of the allelic variant. 

27. The method of claim 24 # further comprising: 

(a) hybridizing a target nucleic acid comprising a N- 
acetylglucosaminyl transferase component GPI-1 (GPI-1 >-encoding nucleic 
acid or fragment thereof with a nucleic acid primer that hybridizes 

15 adjacent to nucleotide 2577 of the GPI-1 gene; 

(b) extending the nucleic acid primer using the target nucleic 
acid as a template; and 

(c) determining the mass of the extended primer to identify the 
nucleotide present at position 2577, thereby determining the presence or 

20 absence of the allelic variant. 

28. The method of claim 19, wherein the detecting step 
comprises mass spectrometry. 

29. The method of claim 19, wherein the detection is effected by 
detecting a signal moiety selected from the group consisting of: 

25 radioisotopes, enzymes, antigens, antibodies, spectrophotometric 

reagents, chemiluminescent reagents, fluorescent reagents and other light 
producing reagents. 

30. The method of claim 19, further comprising detecting the 
presence or absence of at least one allelic variant of polymorphic regions 

30 of another gene associated with cardiovascular disease, wherein the 
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presence of the two allelic variants is associated with a predisposition to 
cardiovascular disease compared to a subject who does not comprise the 
combination of allelic variants, 

31 . The method of claim 30, wherein the other gene is selected 
5 from the group consisting of cholesterol ester transfer protein, plasma 

(CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll 
(APO C3); a gene encoding lipoprotein lipase (LPL); ATP-binding cassette 
transporter (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 
10 5,10-methylenetetrahydrofolate r reductase (MTHFR); a gene encoding 
hepatic lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 
1 receptor gene. 

32. The method of claim 30, wherein the two allelic variants are of 
the cytochrome C oxidase subunit Vlb (COX6B) gene and the N- 

15 acetylglucosaminyl transferase component GPI-1 (GPI-1) gene. 

33. A kit comprising: 

(a) at least one probe specific for a polymorphic region of 
a human gene selected from the group consisting of cytochrome C 
oxidase subunit Vlb (COX6B); N-acetylglucosaminyl transferase 

20 component GPI-1 (GPI-1); cholesterol ester transfer protein, plasma 

(CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO AD; 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein 
C-lll (APO C3(; a gene encoding lipoprotein lipase (LPL); ATP- 
binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 

25 paraoxonase 2 (PON 2); 5,10-methylenetetrahydrofolate r 

reductase (MTHFR); a gene encoding hepatic lipase, E-selectin, G 
protein beta 3 subunit and angiotensin II type 1 receptor gene; and 

(b) instructions for use. 
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34. A method of screening for biologically active agents that 
modulate serum cholesterol, comprising: 

(a) combining a candidate agent with a cell comprising a 
nucleotide sequence encoding an allelic variant of a cytochrome C 

5 oxidase subunit Vlb (COX6B) gene associated with high levels of 

serum cholesterol and operably linked to a promoter such that the 
nucleotide sequence is expressed as a COX6B protein in the cell; 
and 

(b) determining the affect of the agent upon the 
10 expression and/or activity of the COX6B protein. 

35. A method of screening for biologically active agents that 
modulate serum cholesterol, comprising: 

(a) combining a candidate agent with a transgenic mouse 
comprising a transgenic nucleotide sequence stably integrated into 

15 the genome of the mouse, wherein the transgenic nucleotide 

sequence encodes an allelic variant of a cytochrome C oxidase 
subunit Vlb (COX6B) gene associated with high levels of serum 
cholesterol and operably linked to a promoter, wherein the 
transgenic nucleotide sequence is expressed and the transgenic 

20 animal develops a high level of serum cholesterol; and 

(b) determining the affect of the agent upon the serum 
cholesterol level. 

36. The method of claim 34 or claim 37 wherein the allelic 
variant is at position 86 of the cytochrome C oxidase subunit Vlb 

25 (COX6B) gene. 

37. A method of screening for biologically active agents that 
modulate serum high density lipoprotein (HDL), comprising: 

(a) combining a candidate agent with a cell comprising a 
nucleotide sequence encoding an allelic variant of a N- 
30 acetylglucosaminyl transferase component GPI-1 (GPI-1) gene 
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associated with low levels of serum HDL and operably linked to a 
promoter such that the nucleotide sequence is expressed as a 
GPI-1 protein in the cell; and 

(b) determining the affect of the agent upon the 
5 expression and/or activity of the GPI-1 protein. 

38. A method of screening for biologically active agents that 
modulate serum high density lipoprotein (HDL), comprising: 

(a) combining a candidate agent with a transgenic mouse 
comprising a transgenic nucleotide sequence stably integrated into 

10 the genome of the mouse encoding an allelic variant of a N- 

acetylglucosaminyl transferase component GPI-1 (GPI-1) gene 
associated with low levels of serum HDL operably linked to a 
promoter, wherein the transgenic nucleotide sequence is expressed 
and the transgenic animal develops a low level of serum HDL; and 

15 (b) determining the affect of the agent upon the serum 

HDL level. 

39. The method of claim 37 or claim 38, wherein the allelic 
variant is at position 2577 of the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 

20 40. A method for predicting a response of a subject to a 

cardiovascular drug, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit Vlb (COX6B) gene of the subject 
associated with high serum cholesterol or at least one allelic variant of a 
25 N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of the 
subject associated with low serum high density lipoprotein (HDL); 

wherein the presence of at least one allelic variant is indicative of a 
positive response. 

41 . The method of claim 40, wherein the allelic variant is of the 
30 cytochrome C oxidase subunit Vlb (COX6B) gene. 
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42. The method of claim 40, wherein the allelic variant is of the 
N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene. 

43. A method for predicting a response of a subject to a 
cardiovascular drug, comprising: 

5 detecting the presence or absence of at least one allelic variant of a 

cytochrome C oxidase subunit Vlb (COX6B) gene of the subject 
associated with high serum cholesterol; and 

detecting the presence or absence of or at least one allelic variant 
of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of 
10 the subject associated with low serum high density lipoprotein (HDL); 

wherein the presence of at least one allelic variant of the COX6B 
and at least one allelic variant of the GPI-1 gene is indicative of a positive 
response. 

44. A method for predicting a response of a subject to a 

15 biologically active agent that modulates serum cholesterol, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit Vlb (COX6B) gene of the subject 
associated with high cholesterol ; 

wherein the presence of at least one allelic variant is indicative of a 

20 positive response. 

45. A method for predicting a response of a subject to a 
biologically active agent that modulates serum cholesterol, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit Vlb (COX6B) gene of the subject 
25 associated with high cholesterol; and 

detecting the presence or absence of an allelic variant of at least 
one other gene of the subject associated with cardiovascular disease, 
wherein the presence of both allelic variants is indicative of a positive 
response. 
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46. The method of claim 44 or claim 45, wherein the allelic 
variant of the cytochrome C oxidase subunit Vlb (COX6B) gene is at 
position 86. 

47. A method for predicting a response of a subject to a 

5 biologically active agent that modulates serum high density lipoprotein 
(HDL), comprising: 

detecting the presence or absence of at least one allelic variant of a 
N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of the 
subject associated with low HDL; wherein the presence of an allelic 
10 variant is indicative of a positive response. 

48. A method for predicting a response of a subject to a 
biologically active agent that modulates serum high density lipoprotein 
(HDL) levels, comprising: 

(a) detecting the presence or absence of at least one 

15 allelic variant of a N-acetylglucosaminyl transferase component GPI- 

1 (GPI-1) gene associated with low HDL of the subject; and 

(b) detecting the presence or absence of an allelic variant 
in at least one other gene of subject associated with cardiovascular 
disease, wherein the presence of both allelic variants is indicative 

20 of a positive response. 

49. The method of claim 47 or claim 48, wherein the allelic 
variant of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) 
gene is at position 2577. 

50. The method of claim 45 or 48, wherein the other gene 
25 associated with cardiovascular disease is selected from the group of 

genes consisting of N-acetylglucosaminyl transferase component GPI 
(GPI-1) gene, cholesterol ester transfer protein, plasma (CETP); 
apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a 
30 gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 
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(ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 


at a polymorphic region of a cytochrome C oxidase subunit Vlb (COX6B) 
gene associated with high serum cholesterol in combination with a primer 
or probe that specifically hybridizes adjacent to or at a polymorphic region 
of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene 
10 associated with low HDL. 

52. The primers or probes of claim 51, further comprising 
primers or probes that specifically hybridizes adjacent to or at a 
polymorphic region of another gene associated with cardiovascular 
disease. 

15 53, The primers or probes of claim 51, wherein the polymorphic 

region of the cytochrome C oxidase subunit Vlb (COX6B) gene comprises 
nucleotide 86 of the coding strand and the polymorphic region of the N- 
acetylglucosaminyl transferase component GPI-1 (GPI-1) gene comprises 
nucleotide 2577. 

20 54. The primers or probes of claim 52, wherein the 

other gene associated with cardiovascular disease is selected from the 
group of genes consisting of cholesterol ester transfer protein, plasma 
(CETP); apolipoprotein A-IV (APO A4|; apolipoprotein A-l (APO A1); 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll 

25 (APO C3); a gene encoding lipoprotein lipase (LPL); ATP-binding cassette 
transporter (ABC 1); paraoxonase 1 (PON 11; paraoxonase 2 (PON 21; 
5,10-methylenetetrahydrofolate r reductase (MTHFR); a gene encoding 
hepatic lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 
1 receptor gene. 


5 


51 . A primer or probe that specifically hybridizes adjacent to or 
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10 


15 


20 


25 


55. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 

{a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 
subunit Vlb (COX6B) gene associated with high serum cholesterol; 


56. The kit of claim 55, wherein the polymorphic region 
comprises nucleotide 86 of the coding strand. 

57. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 

(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 
subunit Vlb (COX6B) gene associated with high cholesterol; 

(b) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of another gene associated 
with cardiovascular disease; and 

(c) optionally instructions for use. 

58. The kit of claim 57, wherein the other gene associated with 
cardiovascular disease is selected from the group of genes consisting of 
N-acetylglucosaminyl transferase component GPI-1 (GPI-1); cholesterol 
ester transfer protein, plasma (CETP); apolipoprotein A-IV (APO A4); 
apolipoprotein A-l {APO A1); apolipoprotein E (APO E); apolipoprotein B 
(APO B); apolipoprotein C-lll {APO C3); a gene encoding lipoprotein lipase 
(LPL); ATP-binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5, 10-methylenetetrahydrofolate r reductase 
(MTHFR); a gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type 1 receptor gene. 

59. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 


and 


(b) optionally instructions for use. 
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(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1) gene associated with low 
serum high density lipoprotein (HDL); and 

5 (b( optionally instructions for use. 

60. The kit of claim 59, wherein the polymorphic region 
comprises nucleotide 2577 of the coding strand. 

61 . A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 

10 (a) at least one probe or primer that specifically hybridizes 

adjacent to or at a polymorphic region of a N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1) gene associated with low 
serum high density lipoprotein (HDL); 

(b) at least one probe or primer that specifically hybridizes 
15 adjacent to or at a polymorphic region of another gene associated 

with cardiovascular disease; and 

(c) optionally instructions for use. 

62. The kit of claim 61, wherein the other gene associated with 
cardiovascular disease is selected from the group of genes consisting of 

20 cytochrome C oxidase subunit Vlb (COX6B); cholesterol ester transfer 
protein, plasma (CETP); apolipoprotein A-IV (APO A4>; apolipoprotein A-l 
(APO A1); apolipoprotein E (APO E); apolipoprotein B (APO B); 
apolipoprotein C-lll (APO C3); a gene encoding lipoprotein lipase (LPL); 
ATP-binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 

25 paraoxonase 2 (PON 2); 5,10-methylenetetrahydrofolate r reductase 
(MTHFR); a gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type 1 receptor gene. 

63. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 
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(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 
subunit Vlb (COX6B) gene associated with high cholesterol; 

(b) at least one probe or primer that specifically hybridizes 
5 adjacent to or at a polymorphic region of a N-acetylglucosaminyl 

transferase component GPI-1 (GP1-1) gene associated with low 
HDL; and 

(c) optionally instructions for use. 

64. The kit of claim 63, further comprising at least one probe or 
10 primer that specifically hybridizes adjacent to or at a polymorphic region 

of another gene associated with cardiovascular disease. 

65. The kit of claim 64, wherein the other gene associated with 
cardiovascular disease is selected from the group of genes consisting of 
cholesterol ester transfer protein, plasma (CETP); apolipoprotein A-IV 

15 (APO A4); apolipoprotein A-l (APO AD; apolipoprotein E (APO E); 

apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a gene encoding 
lipoprotein lipase (LPL); ATP-binding cassette transporter (ABC 1); 
paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 

20 lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

66. A method of diagnosing a predisposition to cardiovascular 
disease in a human, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
25 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a cytochrome C oxidase subunit Vlb (COX6B) gene 
in the DNA. 
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67. The method of claim 66, wherein at least one variant is a C 
to T transversion at position 86 of the cytochrome C oxidase subunit Vlb 

- gene (COX6B) coding region. 

68. The method of claim 66, further comprising the step of: 
5 detecting the presence or absence of at least one allelic 

variant of a second gene associated with cardiovascular disease. 
69* The method of claim 68, wherein the second gene is 
selected from the group consisting of human N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1); cholesterol ester transfer protein, 

10 plasma (CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO 
Al); apolipoprotein E (APO E); apolipoprotein B {APO B); apolipoprotein C- 
III (APO C3); a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 
(PON 2); 5,10-methylenetetrahydrofolate r reductase (MTHFR); a gene 

15 encoding hepatic lipase, E-selectin, G protein beta 3 subunit and 
angiotensin II type 1 receptor gene. 

70. The method of claim 68, wherein the detecting step is 
performed by an assay selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucleotide ligation, 

20 restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 

71 . A method of diagnosing a predisposition to cardiovascular 
disease in a human, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
25 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a N-acetylglucosaminyl transferase component GPI- 
1 (GPI-1) gene in the DNA. 

72. The method of claim 71, wherein the detecting step is 
30 performed by an assay selected from the group consisting of allele 
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specific hybridization, primer specific extension, oligonucleotide ligation, 
restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 

73. The method of claim 71, wherein at least one variant is a G 
5 to A transversion at position 2577 of a N-acetylglucosaminyl transferase 

component GPI-1 (GPI-1) gene. 

74. A method of determining a response of a human to a 
cardiovascular drug, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
10 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a cytochrome C oxidase subunit Vlb (COX6B) gene 
in the DNA or at least one allelic variant of a N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1) gene in the DNA. 
15 75. The method of claim 74, wherein the detecting step is 

performed by an assay selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucleotide ligation, 
restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 
20 76. A microarray, comprising: 

an isolated nucleic acid molecule comprising a sequence of 
nucleotides of a polymorphic region from a human cytochrome C oxidase 
subunit Vlb (COX6B) gene linked to a solid support. 

77. The microarray of claim 76, wherein the polymorphic region 
25 comprises position 86 of the human cytochrome C oxidase subunit Vlb 

(COX6B) coding region. 

78. A microarray, comprising: 

an isolated nucleic acid molecule comprising a sequence of 
nucleotides uence of a polymorphic region from a human N- 
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acetylglucosaminyl transferase component GPI-1 (GPI-1) gene linked to a 
solid support. 

79. The microarray of claim 78, wherein the polymorphic region 
comprises a locus selected from the group consisting of position 2577 of 

5 the human N-acetylglucosaminyl transferase component GPI-1 (GPI-1) 
gene, position 2829 of the human GPI-1 gene, position 2519 of the 
human GPI-1 gene, position 2289 of the human GPI-1 gene, position 
1938 of the human GPI-1 gene, position 1563 of the human GPI-1 gene, 
position 2656 of the human GPI-1 gene, and position 2664 of the human 
10 GPI-1 gene. 

80. The microarray of claim 91 , wherein the polymorphic region 
comprises position 2577 of the human N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 
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SEQUENCE LISTING 

<110> Braun, Andreas 
Bonsai Aruna 
Kleyn Patrick 

<120> GENES AND POLYMORPHISMS ASSOCIATED WITH 
CARDIOVASCULAR DISEASE AND THEIR USE 


<130> 24736-2048PC 

<140> Unassigned 
<14l> Herewith 

<140> 09/802,640 
<141> 03-09-2001 

<160> 122 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 439 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (45) . . . (305) 

<400> 1 

ttgagctgca ggttgaatcc ggggtgcctt taggattcag cacc atg gcg gaa gac 56 

Met Ala Glu Asp 
1 

atg gag acc aaa ate aag aac tac aag acc gec cct ttt gac age cgc 104 
Met Glu Thr Lys He Lys Asn Tyr Lys Thr Ala Pro Phe Asp Ser Arg 
5 10 15 20 

ttc ccc aac cag aac cag act aga aac tgc tgg cag aac tac ctg gac 152 
Phe Pro Asn Gin Asn Gin Thr Arg Asn Cys Trp Gin Asn Tyr Leu Asp 
25 30 35 


ttc cac cgc tgt cag aag gca atg acc get aaa gga ggc gat ate tct 

Phe His Arg Cys Gin Lys Ala Met Thr Ala Lys Gly Gly Asp He Ser 

40 45 50 

gtg tgc gaa tgg tac cag cgt gtg tac cag tec etc tgc ccc aca tec 

Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu Cys Pro Thr Ser 
55 60 65 

tgg gtc aca gac tgg gat gag caa egg get gaa ggc acg ttt ccc ggg 

Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly Thr Phe Pro Gly 
70 75 80 


200 


248 


296 


aag ate tga actggctgea tctccctttc ctctgtcctc catccttctc 345 
Lys He * 


WO 02/072604 



PCT/US02/06728 


-2- 


85 

ccaggatggt gaagggggac ctggtaccca gtgatcccca ccccaggatc ctaaatcatg 405 
acttacctgc taataaaaac tcattggaaa agtg 439 

<210> 2 

<211> 86 

<212> PRT 

<213> Homo Sapien 

<400> 2 

Met Ala Glu Asp Met Glu Thr Lys He Lys Asn Tyr Lys Thr Ala Pro 

15 10 15 

Phe Asp Ser Arg Phe Pro Asn Gin Asn Gin Thr Arg Asn Cys Trp Gin 

20 25 30 

Asn Tyr Leu Asp Phe His Arg Cys Gin Lys Ala Met Thr Ala Lys Gly 

35 40 45 

Gly Asp lie Ser Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu 

50 55 60 

Cys Pro Thr Ser Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly 
65 70 75 80 

Thr Phe Pro Gly Lys He 
85 

<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 3 

aggattcagc accatggc ] 

<210> 4 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 4 

agcggataac aatttcacac aggtagtctg gttctggttg ggg 

<210> 5 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MassExtend primer 

<400> 5 

aatcaagaac tacaagac 


<210> 6 
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<211> 2921 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (103) . . . (1848) 
<400> 6 

cagcgagcgc cgtcgtctgc ccgggcccgc ccatcggggt ccccaacccc atccggaccc 60 
cgccgcccga gcgcgcggcc ccggaagcac ccgcctcccg gc atg gtg etc aag 114 

Met Val Leu Lys 
1 

gec ttc ttc ccc acg tgc tgc gtc teg gcg gac age ggg ctg ctg gtg 162 
Ala Phe Phe Pro Thr Cys Cys Val Ser Ala Asp Ser Gly Leu Leu Val 
5 10 15 20 

gga egg tgg gtg ccg gag cag age age gec gtg gtc ctg gcg gtc ctg 
Glv Arg Trp Val Pro Glu Gin Ser Ser Ala Val Val Leu Ala Val Leu 
25 30 35 

cac ttt ccc ttc ate ccc ate cag gtc aag cag etc ctg gec cag gtg 
His Phe Pro Phe lie Pro He Gin Val Lys Gin Leu Leu Ala Gin Val 
40 45 50 

egg cag gee age cag gtg ggc gtg gec gtg ctg ggc ace tgg tgc cac 
Arg Gin Ala Ser Gin Val Gly Val Ala Val Leu Gly Thr Trp Cys His 
55 60 65 

tgc egg cag gag ccc gag gag age ctg ggc cgc ttc ctg gag age ctg 354 
Cys Arg Gin Glu Pro Glu Glu Ser Leu Gly Arg Phe Leu Glu Ser Leu 
70 75 80 


150 155 160 

gtc ttc gac acg gta gca cgc agt gag gtg etc ttc cgc agt gac cgc 
Val Phe Asp Thr Val Ala Arg Ser Glu Val Leu Phe Arg Ser Asp Arg 
165 170 175 180 


210 


258 


306 


ggt get gtc ttc ccc cat gag ccc tgg ctg egg ctg tgc egg gag aga 402 
Glv Ala Val Phe Pro His Glu Pro Trp Leu Arg Leu Cys Arg Glu Arg 
85 90 95 100 

ggc ggc acg ttc tgg age tgc gag gee ace cac egg caa gcg ccc act 450 
Gly Gly Thr Phe Trp Ser Cys Glu Ala Thr His Arg Gin Ala Pro Thr 


105 HO US 

gee ccc ggt gec cct ggt gag gac cag gtc atg etc ate ttc tat gac 
Ala Pro Gly Ala Pro Gly Glu Asp Gin Val Met Leu He Phe Tyr Asp 
120 125 130 

cag cgc cag gtg ttg ctg tea cag eta cac ctg ccc acc gtc ctg ccc 
Gin Arg Gin Val Leu Leu Ser Gin Leu His Leu Pro Thr Val Leu Pro 
135 140 145 

gac cgc cag get gga gec acc act gec age acg ggg ggc ctg get gee 594 
Asp Arg Gin Ala Gly Ala Thr Thr Ala Ser Thr Gly Gly Leu Ala Ala 

« « ice ICft 


498 


546 


642 
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ttt gat gag ggc ccc gtg egg ctg age cac tgg cag teg gag ggc gtg 690 
Phe Asp Glu Gly Pro Val Arg Leu Ser His Trp Gin Ser Glu Gly Val 
185 190 195 

gag gec age ate etc gcg gag ctg gee agg cga gec teg gga ccc att 738 
Glu Ala Ser He Leu Ala Glu Leu Ala Arg Arg Ala Ser Gly Pro He 
200 205 210 


tgt ctg ctg ttg gec age ctg ctg teg ctg gtc tea get gtc agt gec 
Cys Leu Leu Leu Ala Ser Leu Leu Ser Leu Val Ser Ala Val Ser Ala 
215 220 225 

tgc cga gtg ttc aag etc tgg ccc ctg tec ttc etc ggg age aaa etc 
Cys Arg Val Phe Lys Leu Trp Pro Leu Ser Phe Leu Gly Ser Lys Leu 
230 235 240 

tec acg tgc gaa cag etc egg cac egg ctg gag cac etc acg eta ate 
Ser Thr Cys Glu Gin Leu Arg His Arg Leu Glu His Leu Thr Leu He 
245 250 255 260 

ttc agt aca egg aag gcg gag aac cct gec cag ctg atg agg aag gec 
Phe Ser Thr Arg Lys Ala Glu Asn Pro Ala Gin Leu Met Arg Lys Ala 
265 270 275 


ctg tec tgg etc cac ggg aga age cgc ate ggg cat ctg gee gac gec 
Leu Ser Trp Leu His Gly Arg Ser Arg He Gly His Leu Ala Asp Ala 
295 300 305 


gac cag gtg ctg ggc cgc ttc ttc etc tac cac ate cac ctg tgg ate 
Asp Gin Val Leu Gly Arg Phe Phe Leu Tyr His He His Leu Trp He 
345 350 355 


tat gga gec agg ctg tac tgc ctg aag ate cat ggc ctg tec tea ctg 
Tyr Gly Ala Arg Leu Tyr Cys Leu Lys He His Gly Leu Ser Ser Leu 


786 


834 


882 


930 


aac acg gtg gec tct gtg ctg ctg gac gtg gec ctg ggc etc atg ctg 978 
Asn Thr Val Ala Ser Val Leu Leu Asp Val Ala Leu Gly Leu Met Leu 
280 285 290 


1026 


etc gtt cct gtg get gac cac gtg gee gag gag etc cag cat ctg ctg 1074 
Leu Val Pro Val Ala Asp His Val Ala Glu Glu Leu Gin His Leu Leu 
310 315 320 

cag tgg ctg atg ggt get ccc gee ggg etc aag atg aac cgt gca ctg 1122 
Gin Trp Leu Met Gly Ala Pro Ala Gly Leu Lys Met Asn Arg Ala Leu 
325 330 335 340 


1170 


age tac ate cac etc atg tec ccc ttc gtg gag cac ate ctt tgg cac 1218 
Ser Tyr He His Leu Met Ser Pro Phe Val Glu His He Leu Trp His 
360 365 370 

gtg ggc etc teg gec tgc ctg ggc ctg acg gtg gec ctg tec etc etc 1266 
Val Gly Leu Ser Ala Cys Leu Gly Leu Thr Val Ala Leu Ser Leu Leu 
375 380 385 

teg gac att ate gec etc etc acc ttc cac ate tac tgc ttt tac gtc 1314 
Ser Asp lie He Ala Leu Leu Thr Phe His He Tyr Cys Phe Tyr Val 
390 395 400 


1362 
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405 


410 


415 


420 


tgg cgt ctg ttc egg ggg aag aag tgg aac gtt ctg cgc cag cgc gtg 
Trp Arg Leu Phe Arg Gly Lys Lys Trp Asn Val Leu Arg Gin Arg Val 
425 430 435 


1410 


gac tec tgt tec tat gac ctg gac cag ctg ttc ate ggg act ctg etc 
Asp Ser Cys Ser Tyr Asp Leu Asp Gin Leu Phe lie Gly Thr Leu Leu 
440 445 450 


1458 


ttc acc ate ctg etc ttc etc ctg cct acc aca gec ctg tac tac ctg 
Phe Thr lie Leu Leu Phe Leu Leu Pro Thr Thr Ala Leu Tyr Tyr Leu 
455 460 465 


1506 


gtg ttc acc ctg etc egg etc ctg gtg gtc gec gtg cag ggc ctg ate 
Val Phe Thr Leu Leu Arg Leu Leu Val Val Ala Val Gin Gly Leu lie 
470 475 480 


1554 


cat ctg ctg gtg gac etc ate aac tec ctg ccg ctg tac tea ctg ggt 
His Leu Leu Val Asp Leu He Asn Ser Leu Pro Leu Tyr Ser Leu Gly 
485 490 495 500 


1602 


ctt egg etc tgc egg ccc tac agg ctg gcg get ggc gtg aag ttc cgt 
Leu Arg Leu Cys Arg Pro Tyr Arg Leu Ala Ala Gly Val Lys Phe Arg 
505 510 515 


1650 


gtc etc egg cac gag gee age agg ccc etc cgc etc ctg atg cag ata 
Val Leu Arg His Glu Ala Ser Arg Pro Leu Arg Leu Leu Met Gin He 
520 525 530 


1698 


aac cca ctg ccc tac age cgc gtg gtg cac acc tac cgc etc ccc age 
Asn Pro Leu Pro Tyr Ser Arg Val Val His Thr Tyr Arg Leu Pro Ser 
535 540 545 


1746 


tgt ggc tgc cac ccc aag cac tec tgg ggc gee ctg tgc cgc aag ctg 
Cys Gly Cys His Pro Lys His Ser Trp Gly Ala Leu Cys Arg Lys Leu 
550 555 560 


1794 


ttc ctt ggg gag etc ate tac ccc tgg agg cag aga ggg gac aag cag 
Phe Leu Gly Glu Leu He Tyr Pro Trp Arg Gin Arg Gly Asp Lys Gin 
565 570 575 580 


1842 


gac tga gggaactget ggctcgcctg gcaccaccac acggccacag ccagccatct 
Asp * 


1898 


gctctgccag 
tgtgctcctg 
gagcagatgt 
tgggacccgc 
ccagagcact 
cagccccagg 
ggggtggagg 
gccgaacagc 
cccgggccct 
gctttgtgaa 
cagcctcagt 
egtagcagea 


ggtggcacca 
aacaeggcag 
gggggtggcc 
ttcccacctg 
gccccatgcc 
tggagggctg 
gaccttgcca 
accctgcatc 
cactccctga 
gaaaccatca 
gtcaagggee 
ggtcctgegg 


gctcagctgg 
gccctgctat 
agecaggctg 
ctgcggtcac 
caccctgcat 
gtctccctgg 
ggatgaaccc 
tgggggattg 
accacacggg 
gcaggctgtg 
tgcccactga 
ccaaatctgt 


cgcatgtccc 
cacaccttgg 
gccgcactcc 
catggtggcg 
acccaggtcc 
gggctcccca 
tccagtccca 
aagcagtege 
gtttatttgc 
agcatcgcca 
cccagccgta 
ctcccttcat 


gtgctttgtg 
gcttggaggt 
atcactggca 
agcacagcaa 
agagggtccg 
gtggctctgc 
ggcaccctct 
tgacccccgt 
ggatgttccc 
ggctgctgtg 
cctattcgtc 
gggcctccca 


gaegctgetg 
cattgggagt 
ctgcctgcct 
ccccaggtgt 
tccaccacag 
cctggctgtg 
agctccctca 
ccccagcggg 
tggagaggtc 
ggggegggag 
cacggtgccc 
gggaaggagg 


1958 
2018 
2078 
2138 
2198 
2258 
2318 
2378 
2438 
2498 
2558 
2618 
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aagccctgct gtgcagacac ctctgtggcc ccccaggggt gtgagcggcc tggggagggg 2678 

gccgtggcac tgaggccgaa agtgcctgcc agacggcacg gtctgggtgc gggtgttccc 2738 

tgtgagcccg agtccgcttc aggaggggag cctgcaggtg ccggctggtg aggggatgac 2798 

gcgctgtggg tgggaggagg cagcgcccat ctcagcagca ccaggactgc ctgggactcc 2858 

ctggcaaccc agcaccgggg aagccgtcag ctgctgtgac aataaaacct gccccgtgtc 2918 

tgg 2 921 

<210> 7 

<211> 581 

<212> PRT 

<213> Homo Sapien 


<400> 7 


Met vai 

Leu 

Lys 

Mi. a 

IT llC 

Phe 

Pro 

Thr 

Cys 

PVR 

^ys 

Val 

Ser 

Ala 

Asp 

Ser 



c 

D 





10 





15 


Gly Leu 

Leu 

Vai 


Arg 

irp 

Val 

Pro 

Glu 

Gin 

Ser 

Ser 

Ala 

Val 

Val 


20 




25 





30 



Leu Ala 

Vol 

35 

Leu 

His 

Phe 

Pro 

Phe 
40 

He 

Pro 

He 

Gin 

Val 
45 

Lvs 

Gin 

Leu 

Leu nla 

Gin 

Val 

- 

Arg 

Gin 

Ala 

Ser 

Gin 

Val 

Glv 

Val 

Ala 

Val 

Leu 

Glv 

50 




55 





60 





Th y T* yr\ 

Cys 

His 

uys 


Gin 

Glu 

Pro 

Glu 

Glu 

Ser 

Leu 

Gly 

Arq 

Phe 

o 3 



70 





75 





80 

Leu Glu 

OC£ 


Gly 
85 

Ala 

Val 

Phe 

Pro 

His 
90 

Glu 

Pro 

Tro 

Leu 

95 

Leu 

Cys Arg 


Arg 


uxy 

Thr 

Phe 

Trp 

Ser 


Glu 

Ala 

Thr 

His 

Ara 


100 





105 





110 



Gin Ala 

Pro 
115 

Thr 

Ala 

Pro 

Gly 

Ala 
120 

Pro 

Gly 

Glu 

Asp 

Gin 
125 

Val 

Met 

Leu 

He Phe 

Tyr 

Asp 

Gin 

Arg 

Gin 

Val 

Leu 

Leu 

Ser 

Gin 

Leu 

His 

Leu 

Pro 

130 



135 





140 





Thr Val 

Leu 

Pro 

Asp 

Arg 

Gin 

Ala Gly 

Ala 

Thr 

Thr 

Ala 

Ser 

Thr 

Gly 

145 



150 





155 





160 

Gly Leu 

Ala 

Ala 

Val 

Phe 

Asp 

Thr 

Val 

Ala 

Arg 

Ser 

Glu 

Val 

Leu 

Phe 



165 





170 





175 


Arg Ser 

Asp 

Arg 

Phe 

Asp 

Glu 

Gly Pro 

Val 

Arg 

Leu 

Ser 

His 

Trp 

Gin 

180 





185 





190 



Ser Glu 

Gly 
195 

Val 

Glu 

Ala 

Ser 

He 
200 

Leu 

Ala 

Glu 

Leu 

Ala 
205 

Arg 

Arg 

Ala 

Ser Gly 

Pro 

He 

Cys 

Leu 

Leu 

Leu 

Ala 

Ser 

Leu 

Leu 

Ser 

Leu 

Val 

Ser 

210 




215 





220 





Ala Val 

Ser 

Ala 

Cys 

Arg 

Val 

Phe 

Lys 

Leu 

Trp 

Pro 

Leu 

Ser 

Phe 

Leu 

225 



230 





235 





240 

Gly Ser 

Lys 

Leu 

Ser 

Thr 

Cys 

Glu 

Gin 

Leu 

Arg 

His 

Arg 

Leu 

Glu 

His 


245 





250 





255 


Leu Thr 

Leu 

He 

Phe 

Ser 

Thr 

Arg 

Lys 

Ala 

Glu 

Asn 

Pro 

Ala 

Gin 

Leu 



260 




265 





270 



Met Arg 

Lys 

Ala 

Asn 

Thr 

Val 

Ala 

Ser 

val 

Leu 

Leu 

Asp 

val 

Ala 

Leu 

275 





280 





285 




Gly Leu 

Met 

Leu 

Leu 

Ser 

Trp 

Leu 

His 

Gly 

Arg 

Ser 

Arg 

He 

Gly 

His 

290 





295 





300 





Leu Ala 

Asp 

Ala 

Leu 

Val 

Pro 

Val 

Ala 

Asp 

His 

Val 

Ala 

Glu 

Glu 

Leu 

305 



310 





315 





320 

Gin His 

Leu 

Leu 

Gin 

Trp 

Leu 

Met Gly 

Ala 

Pro 

Ala 

Gly 

Leu 

Lys 

Met 




325 




330 





335 


Asn Arg 

Ala 

Leu 

Asp 

Gin 

Val 

Leu Gly 

Arg 

Phe 

Phe 

Leu 

Tyr 

His 

He 


340 




345 





350 



His Leu 

Trp 

He 

Ser 

Tyr 

He 

His 

Leu 

Met 

Ser 

Pro 

Phe 

Val 

Glu 

His 
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355 360 365 

He Leu Trp His Val Gly Leu Ser Ala Cys Leu Gly Leu Thr Val Ala 

370 375 380 

Leu Ser Leu Leu Ser Asp He He Ala Leu Leu Thr Phe His He Tyr 
385 390 395 400 

Cys Phe Tyr Val Tyr Gly Ala Arg Leu Tyr Cys Leu Lys He His Gly 

405 410 415 

Leu Ser Ser Leu Trp Arg Leu Phe Arg Gly Lys Lys Trp Asn Val Leu 

420 425 430 

Arg Gin Arg Val Asp Ser Cys Ser Tyr Asp Leu Asp Gin Leu Phe He 

435 440 445 

Gly Thr Leu Leu Phe Thr lie Leu Leu Phe Leu Leu Pro Thr Thr Ala 

450 455 460 

Leu Tyr Tyr Leu Val Phe Thr Leu Leu Arg Leu Leu Val Val Ala Val 
465 470 475 480 

Gin Gly Leu He His Leu Leu Val Asp Leu He Asn Ser Leu Pro Leu 

485 490 495 

Tyr Ser Leu Gly Leu Arg Leu Cys Arg Pro Tyr Arg Leu Ala Ala Gly 

500 505 510 

Val Lys Phe Arg Val Leu Arg His Glu Ala Ser Arg Pro Leu Arg Leu 

515 520 525 

Leu Met Gin He Asn Pro Leu Pro Tyr Ser Arg Val Val His Thr Tyr 

530 535 540 

Arg Leu Pro Ser Cys Gly Cys His Pro Lys His Ser Trp Gly Ala Leu 
545 550 555 560 

Cys Arg Lys Leu Phe Leu Gly Glu Leu He Tyr Pro Trp Arg Gin Arg 

565 570 575 

Gly Asp Lys Gin Asp 
580 

<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 8 

agcagggctt cctccttc 

<210> 9 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 

<400> 9 

agcggataac aatttcacac aggtgaccca gccgtaccta ttc 

<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 


18 


<220> 
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<223> MassExtend primer 
<400> 10 

aagggagaca gatttggc 18 

<210> 11 
<211> 1790 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (131) . . . (1612) 

<223> Nucleotide sequence encoding Cholesterol ester 
transfer protein (CETP) 

<400> 11 

gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc 60 
tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg 120 
cctgataacc atg ctg get gec aca gtc ctg acc ctg gec ctg ctg ggc 169 
Met Leu Ala Ala Thr Val Leu Thr Leu Ala Leu Leu Gly 
15 10 

aat gec cat gec tgc tec aaa ggc acc teg cac gag gca ggc ate gtg 217 
Asn Ala His Ala Cys Ser Lys Gly Thr Ser His Glu Ala Gly He Val 
15 20 25 

tgc cgc ate acc aag cct gee etc ctg gtg ttg aac cac gag act gee 265 
Cys Arg He Thr Lys Pro Ala Leu Leu Val Leu Asn His Glu Thr Ala 
30 35 40 45 

aag gtg ate cag acc gec ttc cag cga gee age tac cca gat ate acg 313 
Lys Val He Gin Thr Ala Phe Gin Arg Ala Ser Tyr Pro Asp He Thr 
50 55 60 

qgc gag aag gec atg atg etc ctt ggc caa gtc aag tat ggg ttg cac 361 
Gly Glu Lys Ala Met Met Leu Leu Gly Gin Val Lys Tyr Gly Leu His 


409 


65 70 75 

aac ate cag ate age cac ttg tec ate gee age age cag gtg gag ctg 

Asn He Gin He Ser His Leu Ser He Ala Ser Ser Gin Val Glu Leu 

80 85 90 

gtg gaa gec aag tec att gat gtc tec att cag aac gtg tct gtg gtc 457 

Val Glu Ala Lys Ser He Asp Val Ser He Gin Asn Val Ser Val Val 

95 100 105 


ttc aag ggg acc ctg aag tat ggc tac acc act gec tgg tgg ctg ggt 

Phe Lys Gly Thr Leu Lys Tyr Gly Tyr Thr Thr Ala Trp Trp Leu Gly 

110 115 120 125 

att gat cag tec att gac ttc gag ate gac tct gec att gac etc cag 

He Asp Gin Ser He Asp Phe Glu He Asp Ser Ala He Asp Leu Gin 

130 135 140 

ate aac aca cag ctg acc tgt gac tct ggt aga gtg egg acc gat gee 

He Asn Thr Gin Leu Thr Cys Asp Ser Gly Arg Val Arg Thr Asp Ala 

145 150 155 


505 


553 


601 
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cct gac tgc tac ctg tct ttc cat aag ctg etc ctg cat etc caa ggg 649 
Pro Asp Cys Tyr Leu Ser Phe His Lys Leu Leu Leu His Leu Gin Gly 
160 165 170 

gag cga gag cct ggg tgg ate aag cag ctg ttc aca aat ttc ate tec 697 
Glu Arg Glu Pro Gly Trp lie Lys Gin Leu Phe Thr Asn Phe lie Ser 
175 180 185 

ttc ace ctg aag ctg gtc ctg aag gga cag ate tgc aaa gag ate aac 745 
Phe Thr Leu Lys Leu Val Leu Lys Gly Gin lie Cys Lys Glu lie Asn 
190 195 200 205 

gtc ate tct aac ate atg gec gat ttt gtc cag aca agg get gec age 793 
Val lie Ser Asn lie Met Ala Asp Phe Val Gin Thr Arg Ala Ala Ser 
210 215 220 

ate ctt tea gat gga gac att ggg gtg gac att tec ctg aca ggt gat 841 
He Leu Ser Asp Gly Asp He Gly Val Asp He Ser Leu Thr Gly Asp 
225 230 235 

ccc gtc ate aca gee tec tac ctg gag tec cat cac aag ggt cat ttc 889 
Pro Val He Thr Ala Ser Tyr Leu Glu Ser His His Lys Gly His Phe 
240 245 250 

ate tac aag aat gtc tea gag gac etc ccc etc ccc ace ttc teg ccc 937 
He Tyr Lys Asn Val Ser Glu Asp Leu Pro Leu Pro Thr Phe Ser Pro 
255 260 265 

aca ctg ctg ggg gac tec cgc atg ctg tac ttc tgg ttc tct gag cga 985 
Thr Leu Leu Gly Asp Ser Arg Met Leu Tyr Phe Trp Phe Ser Glu Arg 
270 275 280 285 

gtc ttc cac teg ctg gec aag gta get ttc cag gat ggc cgc etc atg 1033 
Val Phe His Ser Leu Ala Lys Val Ala Phe Gin Asp Gly Arg Leu Met 
290 295 300 

etc age ctg atg gga gac gag ttc aag gca gtg ctg gag ace tgg ggc 1081 
Leu Ser Leu Met Gly Asp Glu Phe Lys Ala Val Leu Glu Thr Trp Gly 
305 310 315 

ttc aac ace aac cag gaa ate ttc caa gag gtt gtc ggc ggc ttc ccc 1129 
Phe Asn Thr Asn Gin Glu He Phe Gin Glu Val Val Gly Gly Phe Pro 
320 325 330 

age cag gee caa gtc ace gtc cac tgc etc aag atg ccc aag ate tec 1177 
Ser Gin Ala Gin Val Thr Val His Cys Leu Lys Met Pro Lys He Ser 
335 340 345 

tgc caa aac aag gga gtc gtg gtc aat tct tea gtg atg gtg aaa ttc 1225 
Cys Gin Asn Lys Gly Val Val Val Asn Ser Ser Val Met Val Lys Phe 
350 355 360 365 

etc ttt cca cgc cca gac cag caa cat tct gta get tac aca ttt gaa 1273 
Leu Phe Pro Arg Pro Asp Gin Gin His Ser Val Ala Tyr Thr Phe Glu 
. 370 375 380 

gag gat ate gtg act ace gtc cag gee tec tat tct aag aaa aag etc 1321 
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Glu Asp lie Val Thr Thr Val Gin Ala Ser Tyr Ser Lys Lys Lys Leu 
385 390 395 

ttc tta age etc ttg gat ttc cag att aca cca aag act gtt tec aac 1369 
Phe Leu Ser Leu Leu Asp Phe Gin He Thr Pro Lys Thr Val Ser Asn 
400 405 410 

ttg act gag age age tec gag tec ate cag age ttc ctg cag tea atg 1417 
Leu Thr Glu Ser Ser Ser Glu Ser He Gin Ser Phe Leu Gin Ser Met 
415 420 425 

ate ace get gtg ggc ate cct gag gtc atg tct egg etc gag gta gtg 1465 
He Thr Ala Val Gly He Pro Glu Val Met Ser Arg Leu Glu Val Val 
430 435 440 445 

ttt aca gec etc atg aac age aaa ggc gtg age etc ttc gac ate ate 1513 
Phe Thr Ala Leu Met Asn Ser Lys Gly Val Ser Leu Phe Asp He He 
450 455 460 

aac cct gag att ate act cga gat ggc ttc ctg ctg ctg cag atg gac 1561 
Asn Pro Glu He He Thr Arg Asp Gly Phe Leu Leu Leu Gin Met Asp 
465 470 475 


ttt ggc ttc cct gag cac ctg ctg gtg gat ttc etc cag age ttg age 
Phe Gly Phe Pro Glu His Leu Leu Val Asp Phe Leu Gin Ser Leu Ser 
480 485 490 


1609 


tag aagtctccaa ggaggtcggg atggggcttg tagcagaagg caagcaccag 1662 


gctcacagct ggaaccctgg tgtctcctcc agcgtggtgg aagttgggtt aggagtaegg 1722 
agatggagat tggctcccaa ctcctcccta tcctaaaggc ccactggcat taaagtgctg 1782 
tatccaag 1790 

<210> 12 

<211> 493 

<212> PRT 

<213> Homo sapien 


<400> 12 










Gly 


Ala 

His 

Met 

Leu 

Ala 

Ala 

Thr 

Val 

Leu 

Thr 

Leu 

Ala 

Leu 

Leu 

Asn 

1 




5 





10 





15 


Ala 

Cys 

Ser 

Lys 

Gly 

Thr 

Ser 

His 

Glu 

Ala 

Gly 

He 

Val 

Cys 

Arg 

He 



20 




25 





30 



Thr 

Lys 

Pro 

Ala 

Leu 

Leu 

Val 

Leu 

Asn 

His 

Glu 

Thr 

Ala 

Lys 

Val 

He 


35 





40 





45 




Gin 

Thr 

Ala 

Phe 

Gin 

Arg 

Ala 

Ser 

Tyr 

Pro 

Asp 

He 

Thr 

Gly 

Glu 

Lys 


50 




55 





60 





Ala 

Met 

Met 

Leu 

Leu 

Gly 

Gin 

Val 

Lys 

Tyr 

Gly 

Leu 

His 

Asn 

lie 

Gin 

65 





70 





75 





80 

lie 

Ser 

His 

Leu 

Ser 

He 

Ala 

Ser 

Ser 

Gin 

Val 

Glu 

Leu 

Val 

Glu 

Ala 




85 





90 





95 


Lys 

Ser 

He 

Asp 

Val 

Ser 

He 

Gin 

Asn 

Val 

Ser 

Val 

Val 

Phe 

Lys 

Gly 



100 





105 





110 



Thr 

Leu 

Lys 

Tyr 

Gly 

Tyr 

Thr 

Thr 

Ala 

Trp 

Trp 

Leu 

Gly 

He 

Asp 

Gin 



115 




120 





125 




Ser 

He 

Asp 

Phe 

Glu 

He 

Asp 

Ser 

Ala 

He 

Asp 

Leu 

Gin 

He 

Asn 

Thr 
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130 135 140 

Gin Leu Thr Cys Asp Ser Gly Arg Val Arg Thr Asp Ala Pro Asp Cys 
145 150 155 160 

Tyr Leu Ser Phe His Lys Leu Leu Leu His Leu Gin Gly Glu Arg Glu 

165 170 175 

Pro Gly Trp lie Lys Gin Leu Phe Thr Asn Phe lie Ser Phe Thr Leu 

180 185 190 

Lys Leu Val Leu Lys Gly Gin He Cys Lys Glu He Asn Val He Ser 

195 200 205 

Asn He Met Ala Asp Phe Val Gin Thr Arg Ala Ala Ser He Leu Ser 

210 215 220 

Asp Gly Asp He Gly Val Asp He Ser Leu Thr Gly Asp Pro Val He 
225 230 235 240 

Thr Ala Ser Tyr Leu Glu Ser His His Lys Gly His Phe He Tyr Lys 

245 250 255 

Asn Val Ser Glu Asp Leu Pro Leu Pro Thr Phe Ser Pro Thr Leu Leu 

260 265 270 

Gly Asp Ser Arg Met Leu Tyr Phe Trp Phe Ser Glu Arg Val Phe His 

275 280 285 

Ser Leu Ala Lys Val Ala Phe Gin Asp Gly Arg Leu Met Leu Ser Leu 

290 295 300 

Met Gly Asp Glu Phe Lys Ala Val Leu Glu Thr Trp Gly Phe Asn Thr 
305 310 315 320 

Asn Gin Glu He Phe Gin Glu Val Val Gly Gly Phe Pro Ser Gin Ala 

325 330 335 

Gin Val Thr Val His Cys Leu Lys Met Pro Lys He Ser Cys Gin Asn 

340 345 350 

Lys Gly Val Val Val Asn Ser Ser Val Met Val Lys Phe Leu Phe Pro 

355 360 365 

Arg Pro Asp Gin Gin His Ser Val Ala Tyr Thr Phe Glu Glu Asp He 

370 375 380 

Val Thr Thr Val Gin Ala Ser Tyr Ser Lys Lys Lys Leu Phe Leu Ser 
385 390 395 400 

Leu Leu Asp Phe Gin He Thr Pro Lys Thr Val Ser Asn Leu Thr Glu 

405 410 415 

Ser Ser Ser Glu Ser He Gin Ser Phe Leu Gin Ser Met He Thr Ala 

420 425 430 

Val Gly He Pro Glu Val Met Ser Arg Leu Glu Val Val Phe Thr Ala 

435 440 445 

Leu Met Asn Ser Lys Gly Val Ser Leu Phe Asp He He Asn Pro Glu 

450 455 460 

He He Thr Arg Asp Gly Phe Leu Leu Leu Gin Met Asp Phe Gly Phe 
465 470 475 480 

Pro Glu His Leu Leu Val Asp Phe Leu Gin Ser Leu Ser 
485 490 

<210> 13 
<211> 3549 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (175) . . . (1602) 

<223> Nucleotide sequence encoding lipoprotein lipase 
(LPL) 


<400> 13 
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cccctcttcc tcctcctcaa gggaaagctg cccacttcta gctgccctgc catccccttt 60 
aaagggcgac ttgctcagcg ccaaaccgcg gctccagccc tctccagcct ccggctcagc 120 
cggctcatca gtcggtccgc gccttgcagc tcctccagag ggacgcgccc cgag atg 177 

Met 
1 


gag age aaa gec ctg etc gtg ctg act ctg gec gtg tgg etc cag agt 225 
Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin Ser 
5 10 15 

ctg acc gec tec cgc gga ggg gtg gec gec gec gac caa aga aga gat 273 
Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg Asp 
20 25 30 


ttt ate gac ate gaa agt aaa ttt gec eta agg acc cct gaa gac aca 321 
Phe He Asp He Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp Thr 


gee ccg agt cgt ctt tct cct gat gat gca gat ttt gta gac gtc tta 
Ala Pro Ser Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu 

~,f\r\ OftC 


369 


35 40 45 

get gag gac act tgc cac etc att ccc gga gta gca gag tec gtg get 
Ala Glu Asp Thr Cys His Leu He Pro Gly Val Ala Glu Ser Val Ala 
50 55 60 65 

acc tgt cat ttc aat cac age age aaa acc ttc atg gtg ate cat ggc 417 
Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val He His Gly 
1 70 75 80 

tgg acg gta aca gga atg tat gag agt tgg gtg cca aaa ctt gtg gee 465 
Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val Ala 
85 90 95 

qcc ctg tac aag aga gaa cca gac tec aat gtc att gtg gtg gac tgg 
Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val He Val Val Asp Trp 
100 105 HO 

ctg tea egg get cag gag cat tac cca gtg tec gcg ggc tac acc aaa 
Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr Lys 
115 120 125 

ctg gtg gga cag gat gtg gec egg ttt ate aac tgg atg gag gag gag 
Leu Val Gly Gin Asp Val Ala Arg Phe He Asn Trp Met Glu Glu Glu 
130 135 140 145 

ttt aac tac cct ctg gac aat gtc cat etc ttg gga tac age ctt gga 657 
Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu Gly 
150 155 160 

gec cat get get ggc att gca gga agt ctg acc aat aag aaa gtc aac 705 
Ala His Ala Ala Gly He Ala Gly Ser Leu Thr Asn Lys Lys Val Asn 
165 170 175 

aga att act ggc etc gat cca get gga cct aac ttt gag tat gca gaa 753 
Arq He Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala Glu 
* 180 185 190 


513 


561 


609 


801 


195 200 205 


WO 02/072604 



PCT/US02/06728 


■13- 


cac aca ttc acc aga ggg tec cct ggt cga age att gga ate cag aaa 84 9 

His Thr Phe Thr Arg Gly Ser Pro Gly Arg Ser lie Gly He Gin Lys 
210 215 220 225 

cca gtt ggg cat gtt gac att tac ccg aat gga ggt act ttt cag cca 897 
Pro Val Gly His Val Asp He Tyr Pro Asn Gly Gly Thr Phe Gin Pro 
230 235 240 

gga tgt aac att gga gaa get ate cgc gtg att gca gag aga gga ctt 945 
Gly Cys Asn He Gly Glu Ala He Arg Val lie Ala Glu Arg Gly Leu 
245 250 255 

gga gat gtg gac cag eta gtg aag tgc tec cac gag cgc tec att cat 993 
Gly Asp Val Asp Gin Leu Val Lys Cys Ser His Glu Arg Ser He His 
260 265 270 

etc ttc ate gac tct ctg ttg aat gaa gaa aat cca agt aag gec tac 1041 
Leu Phe He Asp Ser Leu Leu Asn Glu Glu Asn Pro Ser Lys Ala Tyr 
275 280 285 

agg tgc agt tec aag gaa gec ttt gag aaa ggg etc tgc ttg agt tgt 1089 
Arg Cys Ser Ser Lys Glu Ala Phe Glu Lys Gly Leu Cys Leu Ser Cys 
290 295 300 305 

aga aag aac cgc tgc aac aat ctg ggc tat gag ate aat aaa gtc aga 1137 
Arg Lys Asn Arg Cys Asn Asn Leu Gly Tyr Glu He Asn Lys Val Arg 
310 315 320 

gec aaa aga age age aaa atg tac ctg aag act cgt tct cag atg ccc 1185 
Ala Lys Arg Ser Ser Lys Met Tyr Leu Lys Thr Arg Ser Gin Met Pro 
325 330 335 

tac aaa gtc ttc cat tac caa gta aag att cat ttt tct ggg act gag 1233 
Tyr Lys Val Phe His Tyr Gin Val Lys lie His Phe Ser Gly Thr Glu 
340 345 350 


agt gaa acc cat acc aat cag gec ttt gag att tct ctg tat ggc acc 
Ser Glu Thr His Thr Asn Gin Ala Phe Glu He Ser Leu Tyr Gly Thr 
355 360 365 


1281 


gtg gec gag agt gag aac ate cca ttc act ctg cct gaa gtt tec aca 1329 

Val Ala Glu Ser Glu Asn lie Pro Phe Thr Leu Pro Glu Val Ser Thr 
370 375 380 385 

aat aag acc tac tec ttc eta att tac aca gag gta gat att gga gaa 1377 

Asn Lys Thr Tyr Ser Phe Leu lie Tyr Thr Glu Val Asp lie Gly Glu 
390 395 400 

eta etc atg ttg aag etc aaa tgg aag agt gat tea tac ttt age tgg 1425 

Leu Leu Met Leu Lys Leu Lys Trp Lys Ser Asp Ser Tyr Phe Ser Trp 
405 410 415 

tea gac tgg tgg age agt ccc ggc ttc gee att cag aag ate aga gta 1473 

Ser Asp Trp Trp Ser Ser Pro Gly Phe Ala He Gin Lys lie Arg Val 

420 425 430 

aaa gca gga gag act cag aaa aag gtg ate ttc tgt tct agg gag aaa 1521 

Lys Ala Gly Glu Thr Gin Lys Lys Val lie Phe Cys Ser Arg Glu Lys 
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435 440 445 

gtg tct cat ttg cag aaa gga aag gca cct gcg gta ttt gtg aaa tgc 156 9 
Val Ser His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys Cys 
450 455 • 460 465 

cat gac aag tct ctg aat aag aag tea ggc tga aactgggega atctacagaa 1622 
His Asp Lys Ser Leu Asn Lys Lys Ser Gly * 
470 475 

caaagaaegg catgtgaatt ctgtgaagaa tgaagtggag gaagtaactt ttacaaaaca 1682 

tacccagtgt ttggggtgtt tcaaaagtgg attttcctga atattaatcc cagccctacc 1742 

cttgttagtt attttaggag acagtctcaa gcactaaaaa gtggctaatt caatttatgg 1802 

ggtatagtgg ccaaatagca catcctccaa cgttaaaaga cagtggatca tgaaaagtgc 1862 

tgttttgtcc tttgagaaag aaataattgt ttgagcgcag agtaaaataa ggctccttca 1922 

tgtggcgtat tgggecatag cctataattg gttagaacct cctattttaa ttggaattct 1982 

ggatctttcg gaetgaggee ttctcaaact ttactctaag tctccaagaa tacagaaaat 2042 

gcttttccgc ggcacgaatc agactcatct acacagcagt atgaatgatg ttttagaatg 2102 

attccctctt gctattggaa tgtggtccag acgtcaacca ggaacatgta acttggagag 2162 

ggacgaagaa agggtctgat aaacacagag gttttaaaca gtccctacca ttggcctgca 2222 

tcatgacaaa gttacaaatt caaggagata taaaatctag atcaattaat tcttaatagg 2282 

etttategtt tattgettaa tccctctctc ccccttcttt tttgtctcaa gattatatta 2342 

taataatgtt ctctgggtag gtgttgaaaa tgagcctgta atcctcagct gacacataat 2402 

ttgaatggtg cagaaaaaaa aaagataccg taattttatt attagattct ccaaatgatt 2462 

ttcatcaatt taaaatcatt caatatctga cagttactct tcagttttag gcttaccttg 2522 

gtcatgette agttgtactt ccagtgcgtc tcttttgttc ctggctttga catgaaaaga 2582 

taggtttgag ttcaaatttt gcattgtgtg agcttctaca gattttagac aaggaccgtt 2642 

tttactaagt aaaagggtgg agaggttcct ggggtggatt cctaagcagt gcttgtaaac 2702 

catcgcgtgc aatgagccag atggagtacc atgagggttg ttatttgttg tttttaacaa 2762 

ctaatcaaga gtgagtgaac aactatttat aaactagatc tcctattttt cagaatgetc 2822 

ttctaegtat aaatatgaaa tgataaagat gtcaaatatc tcagaggcta tagctgggaa 2882 

cccgactgtg aaagtatgtg atatctgaac acatactaga aagctctgea tgtgtgttgt 2942 

ccttcagcat aattcggaag ggaaaacagt cgatcaaggg atgtattgga acatgtcgga 3002 

gtagaaattg ttcctgatgt gecagaaett cgaccctttc tctgagagag atgatcgtgc 3062 

ctataaatag taggaccaat gttgtgatta acatcatcag gcttggaatg aattctctct 3122 

aaaaataaaa tgatgtatga tttgttgttg gcatcccctt tattaattca ttaaatttct 3182 

ggatttgggt tgtgacccag ggtgcattaa cttaaaagat tcactaaagc agcacatagc 3242 

actgggaact ctggctccga aaaactttgt tatatatatc aaggatgttc tggctttaca 3302 

ttttatttat tagctgtaaa tacatgtgtg gatgtgtaaa tggagcttgt acatattgga 3362 

aaggtcattg tggctatctg catttataaa tgtgtggtgc taactgtatg tgtctttatc 3422 

agtgatggtc tcacagagcc aactcactct tatgaaatgg gctttaacaa aacaagaaag 3482 

aaaegtaett aactgtgtga agaaatggaa tcagctttta ataaaattga caacatttta 3542 

ttaccac 3549 

<210> 14 

<211> 475 

<212> PRT 

<213> Homo sapien 

<400> 14 

Met Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin 

15 10 15 

Ser Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg 

20 25 30 

Asp Phe lie Asp lie Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp 

35 40 45 

Thr Ala Glu Asp Thr Cys His Leu lie Pro Gly Val Ala Glu Ser Val 
50 55 60 
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Ala Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val He His 
65 70 75 80 

Gly Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val 

85 90 95 

Ala Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val He Val Val Asp 

100 105 HO 

Trp Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr 

115 120 125 

Lys Leu Val Gly Gin Asp Val Ala Arg Phe He Asn Trp Met Glu Glu 

130 135 140 

Glu Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu 
145 150 155 160 

Gly Ala Hie Ala Ala Gly He Ala Gly Ser Leu Thr Asn Lys Lys Val 

165 170 175 

Asn Arg He Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala 

180 185 190 

Glu Ala Pro Ser Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val 

195 200 205 

Leu His Thr Phe Thr Arg Gly Ser Pro Gly Arg Ser He Gly He Gin 

210 215 220 

Lys Pro Val Gly His Val Asp He Tyr Pro Asn Gly Gly Thr Phe Gin 
225 230 235 240 

Pro Gly Cys Asn lie Gly Glu Ala He Arg Val He Ala Glu Arg Gly 

245 250 255 

Leu Gly Asp Val Asp Gin Leu Val Lys Cys Ser His Glu Arg Ser He 

260 265 270 

His Leu Phe He Asp Ser Leu Leu Asn Glu Glu Asn Pro Ser Lys Ala 

275 280 285 

Tyr Arg Cys Ser Ser Lys Glu Ala Phe Glu Lys Gly Leu Cys Leu Ser 

290 295 300 

Cys Arg Lys Asn Arg Cys Asn Asn Leu Gly Tyr Glu He Asn Lys Val 
305 310 315 320 

Arg Ala Lys Arg Ser Ser Lys Met Tyr Leu Lys Thr Arg Ser Gin Met 

325 330 335 

Pro Tyr Lys Val Phe His Tyr Gin Val Lys He His Phe Ser Gly Thr 

340 345 350 

Glu Ser Glu Thr His Thr Asn Gin Ala Phe Glu He Ser Leu Tyr Gly 

355 360 365 

Thr Val Ala Glu Ser Glu Asn He Pro Phe Thr Leu Pro Glu Val Ser 

370 375 380 

Thr Asn Lys Thr Tyr Ser Phe Leu He Tyr Thr Glu Val Asp He Gly 
385 390 395 400 

Glu Leu Leu Met Leu Lys Leu Lys Trp Lys Ser Asp Ser Tyr Phe Ser 

405 410 415 

Trp Ser Asp Trp Trp Ser Ser Pro Gly Phe Ala He Gin Lys He Arg 

420 425 430 

Val Lys Ala Gly Glu Thr Gin Lys Lys Val He Phe Cys Ser Arg Glu 

435 440 445 

Lys Val Ser His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys 

450 455 460 

Cys His Asp Lys Ser Leu Asn Lys Lys Ser Gly 
465 470 475 


<210> 15 
<211> 1466 
<212> DNA 
<213> Homo sapien 
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<220> 
<221> CDS 

<222> (115) . . . (1305) 

<223> Nucleotide sequence encoding apolipoprotein A- IV 
(APOA4) 

<400> 15 

agttcccact gcagcgcagg tgagctctcc tgaggacctc tctgtcagct cccctgattg 60 
tagggaggca tccagtgtgg caagaaactc ctccagccca gcaagcagct cagg atg 117 

Met 
1 

ttc ctg aag gcc gtg gtc ctg acc ctg gcc ctg gtg get gtc gec gga 165 
Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val Ala Gly 
5 10 15 

gcc agg get gag gtc agt get gac cag gtg gcc aca gtg atg tgg gac 213 
Ala Arg Ala Glu Val Ser Ala Asp Gin Val Ala Thr Val Met Trp Asp 
20 25 30 

tac ttc age cag ctg age aac aat gcc aag gag gcc gtg gaa cat etc 261 
Tyr Phe Ser Gin Leu Ser Asn Asn Ala Lys Glu Ala Val Glu His Leu 
35 40 45 


cag aaa tct gaa etc acc cag caa etc aat gcc etc ttc cag gac aaa 
Gin Lys Ser Glu Leu Thr Gin Gin Leu Asn Ala Leu Phe Gin Asp Lys 
50 55 60 65 


ccc ttt gcc acc gag ctg cat gaa cgc ctg gcc aag gac teg gag aaa 
Pro Phe Ala Thr Glu Leu His Glu Arg Leu Ala Lys Asp Ser Glu Lys 
85 90 95 


ctg ccc cat gcc aat gag gtg age cag aag ate ggg gac aac ctg cga 
Leu Pro His Ala Asn Glu Val Ser Gin Lys He Gly Asp Asn Leu Arg 
115 120 125 


teg ctg agg ccc cac gcc gac gag etc aag gcc aag ate gac cag aac 
Ser Leu Arg Pro His Ala Asp Glu Leu Lys Ala Lys He Asp Gin Asn 


309 


ctt gga gaa gtg aac act tac gca ggt gac ctg cag aag aag ctg gtg 357 
Leu Gly Glu Val Asn Thr Tyr Ala Gly Asp Leu Gin Lys Lys Leu Val 
70 75 80 


405 


ctg aag gag gag att ggg aag gag ctg gag gag ctg agg gcc egg ctg 453 
Leu Lys Glu Glu He Gly Lys Glu Leu Glu Glu Leu Arg Ala Arg Leu 
100 105 HO 


501 


gag ctt cag cag cgc ctg gag ccc tac gcg gac cag ctg cgc acc cag 54 9 

Glu Leu Gin Gin Arg Leu Glu Pro Tyr Ala Asp Gin Leu Arg Thr Gin 
130 135 140 145 

gtc aac acg cag gcc gag cag ctg egg cgc cag ctg acc ccc tac gca 597 
Val Asn Thr Gin Ala Glu Gin Leu Arg Arg Gin Leu Thr Pro Tyr Ala 
150 155 160 

cag cgc atg gag aga gtg ctg egg gag aac gcc gac age ctg cag gcc 64 5 

Gin Arg Met Glu Arg Val Leu Arg Glu Asn Ala Asp Ser Leu Gin Ala 
165 170 175 


693 
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ggc cac ttg age ttc ctg gag aag gac ctg agg gac aag gtc aac tec 1173 

Gly His Leu Ser Phe Leu Glu Lys Asp Leu Arg Asp Lys Val Asn Ser 

340 345 350 


-17- 


180 185 190 

gtg gag gag etc aag gga cgc ctt acg ccc tac get gac gaa ttc aaa 741 
Val Glu Glu Leu Lys Gly Arg Leu Thr Pro Tyr Ala Asp Glu Phe Lys 
195 200 205 

gtc aag att gac cag ace gtg gag gag ctg cgc cgc age ctg get ccc 789 
Val Lys lie Asp Gin Thr Val Glu Glu Leu Arg Arg Ser Leu Ala Pro 
210 215 220 225 

tat get cag gac acg cag gag aag etc aac cac cag ctt gag ggc ctg 837 
Tyr Ala Gin Asp Thr Gin Glu Lys Leu Asn His Gin Leu Glu Gly Leu 
230 235 240 

ace ttc cag atg aag aag aac gee gag gag etc aag gee agg ate teg 885 
Thr Phe Gin Met Lys Lys Asn Ala Glu Glu Leu Lys Ala Arg lie Ser 
245 250 255 

gee agt gee gag gag ctg egg cag agg ctg gcg ccc ttg gee gag gac 933 
Ala Ser Ala Glu Glu Leu Arg Gin Arg Leu Ala Pro Leu Ala Glu Asp 
260 265 270 

gtg cgt ggc aac ctg agg ggc aac ace gag ggg ctg cag aag tea ctg 
Val Arg Gly Asn Leu Arg Gly Asn Thr Glu Gly Leu Gin Lys Ser Leu 
275 280 285 

gca gag ctg ggt ggg cac ctg gac cag cag gtg gag gag ttc cga cgc 
Ala Glu Leu Gly Gly His Leu Asp Gin Gin Val Glu Glu Phe Arg Arg 
290 295 300 305 

egg gtg gag ccc tac ggg gaa aac ttc aac aaa gee ctg gtg cag cag 1077 
Arq Val Glu Pro Tyr Gly Glu Asn Phe Asn Lys Ala Leu Val Gin Gin 
3 310 315 320 

atg gaa cag etc agg acg aaa ctg ggc ccc cat gcg ggg gac gtg gaa 1125 
Met Glu Gin Leu Arg Thr Lys Leu Gly Pro His Ala Gly Asp Val Glu 


981 


1029 


1221 


ttc ttc age ace ttc aag gag aaa gag age cag gac aag act etc tec 
Phe Phe Ser Thr Phe Lys Glu Lys Glu Ser Gin Asp Lys Thr Leu Ser 
355 360 365 

etc cct gag ctg gag caa cag cag gaa cag cat cag gag cag cag cag 1269 
Leu Pro Glu Leu Glu Gin Gin Gin Glu Gin His Gin Glu Gin Gin Gin 
370 375 380 385 

gag cag gtg cag atg ctg gee cct ttg gag age tga gctgcccctg 1315 
Glu Gin Val Gin Met Leu Ala Pro Leu Glu Ser * 
390 395 

gtgcactggc cccaccctcg tggacacctg ccctgccctg ccacctgtct gtctgtccca 1375 
aagaagttct ggtatgaact tgaggacaca tgtccagtgg gaggtgagac cacctctcaa 1435 
tattcaataa agetgetgag aatctagect c I 466 
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<210> 16 

<211> 396 

<212> PRT 

<213> Homo sapien 

<400> 16 

Met Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val Ala 

1 5 10 15 

Gly Ala Arg Ala Glu Val Ser Ala Asp Gin Val Ala Thr Val Met Trp 

20 25 30 

Asp Tyr Phe Ser Gin Leu Ser Asn Asn Ala Lys Glu Ala Val Glu His 

35 40 45 

Leu Gin Lys Ser Glu Leu Thr Gin Gin Leu Asn Ala Leu Phe Gin Asp 

50 55 60 

Lys Leu Gly Glu Val Asn Thr Tyr Ala Gly Asp Leu Gin Lys Lys Leu 
65 70 75 80 

Val Pro Phe Ala Thr Glu Leu His Glu Arg Leu Ala Lys Asp Ser Glu 

85 90 95 

Lys Leu Lys Glu Glu lie Gly Lys Glu Leu Glu Glu Leu Arg Ala Arg 

100 105 110 

Leu Leu Pro His Ala Asn Glu Val Ser Gin Lys lie Gly Asp Asn Leu 

115 120 125 

Arg Glu Leu Gin Gin Arg Leu Glu Pro Tyr Ala Asp Gin Leu Arg Thr 

130 135 140 

Gin Val Asn Thr Gin Ala Glu Gin Leu Arg Arg Gin Leu Thr Pro Tyr 
145 150 155 160 

Ala Gin Arg Met Glu Arg Val Leu Arg Glu Asn Ala Asp Ser Leu Gin 

165 170 175 

Ala Ser Leu Arg Pro His Ala Asp Glu Leu Lys Ala Lys lie Asp Gin 

180 185 190 

Asn Val Glu Glu Leu Lys Gly Arg Leu Thr Pro Tyr Ala Asp Glu Phe 

195 200 205 

Lys Val Lys lie Asp Gin Thr Val Glu Glu Leu Arg Arg Ser Leu Ala 

210 215 220 

Pro Tyr Ala Gin Asp Thr Gin Glu Lys Leu Asn His Gin Leu Glu Gly 
225 230 235 240 

Leu Thr Phe Gin Met Lys Lys Asn Ala Glu Glu Leu Lys Ala Arg He 

245 250 255 

Ser Ala Ser Ala Glu Glu Leu Arg Gin Arg Leu Ala Pro Leu Ala Glu 

260 265 270 

Asp Val Arg Gly Asn Leu Arg Gly Asn Thr Glu Gly Leu Gin Lys Ser 

275 280 285 

Leu Ala Glu Leu Gly Gly His Leu Asp Gin Gin Val Glu Glu Phe Arg 

290 295 300 

Arg Arg Val Glu Pro Tyr Gly Glu Asn Phe Asn Lys Ala Leu Val Gin 
305 310 315 320 

Gin Met Glu Gin Leu Arg Thr Lys Leu Gly Pro His Ala Gly Asp Val 

325 330 335 

Glu Gly His Leu Ser Phe Leu Glu Lys Asp Leu Arg Asp Lys Val Asn 

340 345 350 

Ser Phe Phe Ser Thr Phe Lys Glu Lys Glu Ser Gin Asp Lys Thr Leu 

355 360 365 

Ser Leu Pro Glu Leu Glu Gin Gin Gin Glu Gin His Gin Glu Gin Gin 

370 375 380 

Gin Glu Gin Val Gin Met Leu Ala Pro Leu Glu Ser 
385 390 395 


<210> 17 
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<211> 1156 

<212> DNA 

<213> Homo sapieh 

<220> 
<221> CDS 

<222> (61) . . . (1014) 

<223> Nucleotide Sequence encoding apolipoprotein E 
(APOE) 

<400> 17 

cgcagcggag gtgaaggacg tccttcccca ggagccgact ggccaatcac aggcaggaag 60 
atg aag gtt ctg tgg get gcg ttg ctg gtc aca ttc ctg gca gga tgc 108 
Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 
15 10 15 

cag gec aag gtg gag caa gcg gtg gag aca gag ccg gag ccc gag ctg 156 
Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 
20 25 30 

cgc cag cag acc gag tgg cag age ggc cag cgc tgg gaa ctg gca ctg 204 
Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 
35 40 45 

ggt cgc ttt tgg gat tac ctg cgc tgg gtg cag aca ctg tct gag cag 252 
Gly Arg Phe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 
50 55 60 

gtg cag gag gag ctg etc age tec cag gtc acc cag gaa ctg agg gcg 300 
Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 
65 70 75 80 

ctg atg gac gag acc atg aag gag ttg aag gec tac aaa teg gaa ctg 
Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lys Ser Glu Leu 
85 90 95 

gag gaa caa ctg acc ccg gtg gcg gag gag acg egg gca egg ctg tec 
Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 
100 105 110 

aag gag ctg cag gcg gcg cag gee egg ctg ggc gcg gac atg gag gac 444 
Lys Glu Leu Gin Ala Ala Gin Ala Arg Leu Gly Ala Asp Met Glu Asp 
115 120 125 


145 150 155 160 

aag ctg cgt aag egg etc etc cgc gat gee gat gac ctg cag aag cgc 
Lys Leu Arg Lys Arg Leu Leu Arg Asp Ala Asp Asp Leu Gin Lys Arg 
165 170 175 


348 


396 


gtg tgc ggc cgc ctg gtg cag tac cgc ggc gag gtg cag gee atg etc 492 
Val Cys Gly Arg Leu Val Gin Tyr Arg Gly Glu Val Gin Ala Met Leu 
130 135 140 

ggc cag age acc gag gag ctg egg gtg cgc etc gee tec cac ctg cgc 540 
Gly Gin Ser Thr Glu Glu Leu Arg Val Arg Leu Ala Ser His Leu Arg 
.. A icn 155 160 


588 


ctg gca gtg tac cag gec ggg gee cgc gag ggc gee gag cgc ggc etc 636 
Leu Ala Val Tyr Gin Ala Gly Ala Arg Glu Gly Ala Glu Arg Gly Leu 
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180 185 190 

age gec ate cgc gag cgc ctg ggg ccc ctg gtg gaa cag ggc cgc gtg 684 
Ser Ala lie Arg Glu Arg Leu Gly Pro Leu Val Glu Gin Gly Arg Val 
195 200 205 

egg gec gee act gtg ggc tec ctg gee ggc cag ccg eta cag gag egg 732 
Arg Ala Ala Thr Val Gly Ser Leu Ala Gly Gin Pro Leu Gin Glu Arg 
210 215 220 


gec cag gee tgg ggc gag egg ctg cgc gcg egg atg gag gag atg ggc 
Ala Gin Ala Trp Gly Glu Arg Leu Arg Ala Arg Met Glu Glu Met Gly 
225 230 235 240 


780 


age egg ace cgc gac cgc ctg gac gag gtg aag gag cag gtg gcg gag 828 
Ser Arg Thr Arg Asp Arg Leu Asp Glu Val Lys Glu Gin Val Ala Glu 
245 250 255 

gtg cgc gee aag ctg gag gag cag gee cag cag ata cgc ctg cag gec 876 
Val Arg Ala Lys Leu Glu Glu Gin Ala Gin Gin He Arg Leu Gin Ala 
260 265 270 

gag gec ttc cag gee cgc etc aag age tgg ttc gag ccc ctg gtg gaa 924 
Glu Ala Phe Gin Ala Arg Leu Lys Ser Trp Phe Glu Pro Leu Val Glu 
275 280 285 

gac atg cag cgc cag tgg gec ggg ctg gtg gag aag gtg cag get gec 972 
Asp Met Gin Arg Gin Trp Ala Gly Leu Val Glu Lys Val Gin Ala Ala 
290 295 300 

gtg ggc acc age gee gee cct gtg ccc age gac aat cac tga 1014 
Val Gly Thr Ser Ala Ala Pro Val Pro Ser Asp Asn His * 
305 310 315 

acgccgaagc ctgcagccat gcgaccccac gccaccccgt gcctcctgcc tccgcgcagc 1074 

ctgcagcggg agaccctgtc cccgccccag ccgtcctcct ggggtggacc ctagtttaat 1134 

aaagattcac caagtttcac gc H56 

<210> 18 

<211> 317 

<212> PRT 

<213> Homo sapien 

<400> 18 

Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 

15 10 15 

Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 

20 25 30 

Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 

35 40 45 

Gly Arg Phe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 

50 55 60 

Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 
65 70 75 80 

Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lys Ser Glu Leu 

85 90 95 

Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 
100 105 110 
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Lys Glu Leu Gin Ala Ala Gin Ala Arg Leu Gly Ala Asp Met Glu Asp 

115 120 125 

Val Cys Gly Arg Leu Val Gin Tyr Arg Gly Glu Val Gin Ala Met Leu 

130 135 140 

Gly Gin Ser Thr Glu Glu Leu Arg Val Arg Leu Ala Ser His Leu Arg 
145 150 155 160 

Lys Leu Arg Lys Arg Leu Leu Arg Asp Ala Asp Asp Leu Gin Lys Arg 

165 170 175 

Leu Ala Val Tyr Gin Ala Gly Ala Arg Glu Gly Ala Glu Arg Gly Leu 

180 185 190 

Ser Ala He Arg Glu Arg Leu Gly Pro Leu Val Glu Gin Gly Arg Val 

195 200 205 

Arg Ala Ala Thr Val Gly Ser Leu Ala Gly Gin Pro Leu Gin Glu Arg 

210 215 220 

Ala Gin Ala Trp Gly Glu Arg Leu Arg Ala Arg Met Glu Glu Met Gly 
225 230 235 240 

Ser Arg Thr Arg Asp Arg Leu Asp Glu Val Lys Glu Gin Val Ala Glu 

245 250 255 

Val Arg Ala Lys Leu Glu Glu Gin Ala Gin Gin He Arg Leu Gin Ala 

260 265 270 

Glu Ala Phe Gin Ala Arg Leu Lys Ser Trp Phe Glu Pro Leu Val Glu 

275 280 285 

Asp Met Gin Arg Gin Trp Ala Gly Leu Val Glu Lys Val Gin Ala Ala 

290 295 300 

Val Gly Thr Ser Ala Ala Pro Val Pro Ser Asp Asn His 
305 310 315 

<210> 19 
<211> 1603 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (58) ... (1557) 

<223> Nucleotide sequence encoding hepatic lipase (LIPCJ 
<400> 19 

ggtctctttg gcttcagaaa ttaccaagaa agcctggacc ccgggtgaaa cggagaa atg 

1 

gac aca agt ccc ctg tgt ttc tec att ctg ttg gtt tta tgc ate ttt 
Asp Thr Ser Pro Leu Cys Phe Ser He Leu Leu Val Leu Cys He Phe 
5 10 I* 


60 


108 


ate caa tea agt gec ctt gga caa age ctg aaa cca gag cca ttt gga 156 
He Gin Ser Ser Ala Leu Gly Gin Ser Leu Lys Pro Glu Pro Phe Gly 
20 25 30 

aga aga get caa get gtt gaa aca aac aaa acg ctg cat gag atg aag 204 
Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met Lys 
35 40 45 

acc aga ttc ctg etc ttt gga gaa ace aat cag ggc tgt cag att cga 252 
Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin He Arg 
50 55 60 65 
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atc aat cat ccg gac acg tta cag gag tgc ggc ttc aac tec tec ctg 300 
He Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser Leu 
70 75 80 

cct ctg gtg atg ata ate cac ggg tgg teg gtg gac ggc gtg eta gaa 348 
Pro Leu Val Met He He His Gly Trp Ser Val Asp Gly Val Leu Glu 
85 90 95 

aac tgg ate tgg cag atg gtg gee gcg ctg aag tct cag ccg gee cag 396 
Asn Trp He Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala Gin 
100 105 110 

cca gtg aac gtg ggg ctg gtg gac tgg ate acc ctg gec cac gac cac 444 
Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala His Asp His 
115 120 125 

tac acc ate gee gtc cgc aac acc cgc ctt gtg ggc aag gag gtc gcg 492 
Tyr Thr He Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val Ala 
130 135 140 145 

get ctt etc egg tgg ctg gag gaa tct gtt caa etc tct cga age cat 540 
Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser His 
150 155 160 

gtt cac eta att ggg tac age ctg ggt gca cac gtg tea gga ttt gee 588 
Val His Leu He Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe Ala 
165 170 175 

ggc agt tec ate ggt gga acg cac aag att ggg aga ate aca ggg ctg 636 
Gly Ser Ser He Gly Gly Thr His Lys He Gly Arg He Thr Gly Leu 
180 185 190 

gat gee gcg gga cct ttg ttt gag gga agt gee ccc age aat cgt ctt 684 
Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg Leu 
195 200 205 

tct cca gat gat gee aat ttt gtg gat gee att cat acc ttt acg egg 732 
Ser Pro Asp Asp Ala Asn Phe Val Asp Ala He His Thr Phe Thr Arg 
210 215 220 225 

gag cac atg ggc ctg age gtg ggc ate aaa cag ccc ata gga cac tat 780 
Glu His Met Gly Leu Ser Val Gly He Lys Gin Pro He Gly His Tyr 
230 235 240 

gac ttc tat ccc aac ggg ggc tec ttc cag cct ggc tgc cac ttc eta 828 
Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe Leu 
245 250 255 

gag etc tac aga cat att gee cag cac ggc ttc aat gee ate acc cag 876 
Glu Leu Tyr Arg His He Ala Gin His Gly Phe Asn Ala He Thr Gin 
260 265 270 

acc ata aaa tgc tec cac gag cga teg gtg cac ctt ttc ate gac tec 924 
Thr He Lys Cys Ser His Glu Arg Ser Val His Leu Phe He Asp Ser 
275 280 285 

ttg ctg cac gee ggc acg cag age atg gee tac ccg tgt ggt gac atg 972 
Leu Leu His Ala Gly Thr Gin Ser Met Ala Tyr Pro Cys Gly Asp Met 
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290 295 300 305 

aac age ttc age cag ggc ctg tgc ctg age tgc aag aag ggc cgc tgc 
Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg Cys 
310 315 320 

aac acg ctg ggc tac cac gtc cgc cag gag ccg egg age aag age aag 
Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser Lys 
325 330 335 

agg etc ttc etc gta acg cga gee cag tec ccc ttc aaa gtt tat cat 
Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr His 
340 345 350 


cag acc ate ate cca tgg age aca ggg ccg cgc cac tea ggc etc gtt 

Gin Thr He He Pro Trp Ser Thr Gly Pro Arg His Ser Gly Leu Val 
435 440 445 

ctg aag acg ate aga gtc aaa gca gga gaa acc cag caa aga atg aca 

Leu Lys Thr He Arg Val Lys Ala Gly Glu Thr Gin Gin Arg Met Thr 

450 455 460 465 

ttt tgt tea gaa aac aca gat gac eta eta ctt cgc cca acc cag gaa 

Phe Cys Ser Glu Asn Thr Asp Asp Leu Leu Leu Arg Pro Thr Gin Glu 

470 475 480 


1020 


1068 


1116 


tac cag tta aag ate cag ttc ate aac caa act gag acg cca ata caa 1164 
Tyr Gin Leu Lys He Gin Phe He Asn Gin Thr Glu Thr Pro He Gin 
355 360 365 

aca act ttt acc atg tea eta etc gga aca aaa gag aaa atg cag aaa 1212 
Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin Lys 
370 375 380 385 

att ccc ate act ctg ggc aaa gga att get agt aat aaa acg tat tec 1260 
He Pro He Thr Leu Gly Lys Gly He Ala Ser Asn Lys Thr Tyr Ser 
390 395 400 

ttt ctt ate acg ctg gat gtg gat ate ggc gag ctg ate atg ate aag 1308 
Phe Leu He Thr Leu Asp Val Asp He Gly Glu Leu He Met He Lys 
405 410 415 

ttc aag tgg gaa aac agt gca gtg tgg gee aat gtc tgg gac acg gtc 1356 
Phe Lys Trp Glu Asn Ser Ala Val Trp Ala Asn Val Trp Asp Thr Val 
420 425 430 


1404 


1452 


1500 


aaa ate ttc gtg aaa tgt gaa ata aag tct aaa aca tea aag cga aag 1548 
Lys He Phe Val Lys Cys Glu He Lys Ser Lys Thr Ser Lys Arg Lys 
485 490 495 

ate aga tga gatttaatga agacccagtg taaagaataa atgaatctta 1597 
He Arg * 


ctcctt 


1603 


<210> 20 
<2H> 499 
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<212> PRT 

<213> Homo sapien 

<400> 20 

Met Asp Thr Ser Pro Leu Cys Phe Ser He Leu Leu Val Leu Cys He 

15 ,10 15 

Phe He Gin Ser Ser Ala Leu Gly Gin Ser Leu Lys Pro Glu Pro Phe 

20 25 30 

Gly Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met 

35 40 45 

Lys Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin He 

50 55 60 

Arg He Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser 
65 70 75 80 

Leu Pro Leu Val Met He lie His Gly Trp Ser Val Asp Gly Val Leu 

85 90 95 

Glu Asn Trp lie Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala 

100 105 110 

Gin Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala His Asp 

115 120 125 

His Tyr Thr lie Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val 

130 135 140 

Ala Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser 
145 150 155 160 

His Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe 

165 170 175 

Ala Gly Ser Ser He Gly Gly Thr His Lys He Gly Arg He Thr Gly 

180 185 190 

Leu Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg 

195 200 205 

Leu Ser Pro Asp Asp Ala Asn Phe Val Asp Ala lie His Thr Phe Thr 

210 215 220 

Arg Glu His Met Gly Leu Ser Val Gly lie Lys Gin Pro lie Gly His 
225 230 235 240 

Tyr Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe 

245 250 255 

Leu Glu Leu Tyr Arg His He Ala Gin His Gly Phe Asn Ala lie Thr 

260 265 270 

Gin Thr lie Lys Cys Ser His Glu Arg Ser Val His Leu Phe He Asp 

275 280 285 

Ser Leu Leu His Ala Gly Thr Gin Ser Met Ala Tyr Pro Cys Gly Asp 

290 295 300 

Met Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg 
305 310 315 320 

Cys Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser 

325 330 335 

Lys Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr 

340 345 350 

His Tyr Gin Leu Lys lie Gin Phe lie Asn Gin Thr Glu Thr Pro lie 

355 360 365 

Gin Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin 

370 375 380 

Lys lie Pro lie Thr Leu Gly Lys Gly lie Ala Ser Asn Lys Thr Tyr 
385 390 395 400 

Ser Phe Leu lie Thr Leu Asp Val Asp lie Gly Glu Leu lie Met lie 

405 410 415 

Lys Phe Lys Trp Glu Asn Ser Ala Val Trp Ala Asn Val Trp Asp Thr 


420 


425 


430 
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Val 

Gin 

Thr 

He 

He 

Pro Trp 

Ser 

Thr 

Gly Pro Arg 

His 

Ser Gly 

Leu 



435 




440 



445 



VaX 

Leu 

Lys 

Thr 

He 

Arg Val 

Lys 

Ala 

Gly Glu Thr 

Gin Gin Arg 

Met 


450 



455 



460 




Thr 

Phe 

Cys 

Ser 

Glu 

Asn Thr Asp Asp 

Leu Leu Leu 

Arg 

Pro Thr 

Gin 

465 




470 



475 



480 

Glu Lys 

lie 

Phe 

Val 

Lys Cys 

Glu 

He 

Lys Ser Lys 

Thr 

Ser Lys 

Arg 





485 




490 


495 


Lys 

lie Arg 











<210> 21 

<211> 1346 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (10) . . . (1077) 

<223> Nucleotide sequence encoding paraoxonase 1 (POND 
<400> 21 

cccccgacc atg gcg aag ctg att gcg etc acc etc ttg ggg atg gga ctg 51 
Met Ala Lys Leu He Ala Leu Thr Leu Leu Gly Met Gly Leu 
15 10 

gca etc ttc agg aac cac cag tct tct tac caa aca cga ctt aat get 99 
Ala Leu Phe Arg Asn His Gin Ser Ser Tyr Gin Thr Arg Leu Asn Ala 
15 20 25 30 

etc cga gag gta caa ccc gta gaa ctt cct aac tgt aat tta gtt aaa 147 
Leu Arg Glu Val Gin Pro Val Glu Leu Pro Asn Cys Asn Leu Val Lys 
35 40 45 

gga ate gaa act ggc tct gaa gac atg gag at a ctg cct aat gga ctg 195 
Gly He Glu Thr Gly Ser Glu Asp Met Glu He Leu Pro Asn Gly Leu 
50 55 60 

get ttc att age tct gga tta aag tat cct gga ata aag age ttc aac 243 
Ala Phe He Ser Ser Gly Leu Lys Tyr Pro Gly He Lys Ser Phe Asn 
65 70 75 

ccc aac agt cct gga aaa ata ctt ctg atg gac ctg aat gaa gaa gat 291 
Pro Asn Ser Pro Gly Lys He Leu Leu Met Asp Leu Asn Glu Glu Asp 
80 85 90 

cca aca gtg ttg gaa ttg ggg ate act gga agt aaa ttt gat gta tct 
Pro Thr Val Leu Glu Leu Gly He Thr Gly Ser Lys Phe Asp Val Ser 
95 100 105 HO 

tea ttt aac cct cat ggg att age aca ttc aca gat gaa gat aat gec 
Ser Phe Asn Pro His Gly He Ser Thr Phe Thr Asp Glu Asp Asn Ala 
115 120 125 


339 


387 


atg tac etc ctg gtg gtg aac cat cca gat gee aag tec aca gtg gag 435 
Met Tyr Leu Leu Val Val Asn His Pro Asp Ala Lys Ser Thr Val Glu 
130 135 140 
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ttg ttt aaa ttt caa gaa gaa gaa aaa teg ctt ttg cat eta aaa acc 483 
Leu Phe Lys Phe Gin Glu Glu Glu Lys Ser Leu Leu His Leu Lys Thr 
145 150 155 

ate aga cat aaa ctt ctg cct aat ttg aat gat att gtt get gtg gga 531 
lie Arg His Lys Leu Leu Pro Asn Leu Asn Asp lie Val Ala Val Gly 
160 165 170 

cct gag cac ttt tat ggc aca aat gat cac tat ttt ctt gac ccc tac 579 
Pro Glu His Phe Tyr Gly Thr Asn Asp His Tyr Phe Leu Asp Pro Tyr 
175 180 185 190 

tta caa tec tgg gag atg tat ttg ggt tta gcg tgg teg tat gtt gtc 627 
Leu Gin Ser Trp Glu Met Tyr Leu Gly Leu Ala Trp Ser Tyr Val Val 
195 200 205 

tac tat agt cca agt gaa gtt cga gtg gtg gca gaa gga ttt gat ttt 675 
Tyr Tyr Ser Pro Ser Glu Val Arg Val Val Ala Glu Gly Phe Asp Phe 
210 215 220 

get aat gga ate aac att tea ccc gat ggc aag tat gtc tat ata get 723 
Ala Asn Gly lie Asn lie Ser Pro Asp Gly Lys Tyr Val Tyr lie Ala 
225 230 235 

gag ttg ctg get cat aag att cat gtg tat gaa aag cat get aat tgg 771 
Glu Leu Leu Ala His Lys lie His Val Tyr Glu Lys His Ala Asn Trp 
240 245 * 250 

act tta act cca ttg aag tec ctt gac ttt aat acc etc gtg gat aac 819 
Thr Leu Thr Pro Leu Lys Ser Leu Asp Phe Asn Thr Leu Val Asp Asn 
255 260 265 270 

ata tct gtg gat cct gag aca gga gac ctt tgg gtt gga tgc cat ccc 867 
lie Ser Val Asp Pro Glu Thr Gly Asp Leu Trp Val Gly Cys His Pro 
275 280 285 

aat ggc atg aaa ate ttc ttc tat gac tea gag aat cct cct gca tea 915 
Asn Gly Met Lys lie Phe Phe Tyr Asp Ser Glu Asn Pro Pro Ala Ser 
290 295 300 

gag gtg ctt cga ate cag aac att eta aca gaa gaa cct aaa gtg aca 963 
Glu Val Leu Arg He Gin Asn lie Leu Thr Glu Glu Pro Lys Val Thr 
305 310 315 

cag gtt tat gca gaa aat ggc aca gtg ttg caa ggc agt aca gtt gee 1011 
Gin Val Tyr Ala Glu Asn Gly Thr Val Leu Gin Gly Ser Thr Val Ala 
320 325 330 

tct gtg tac aaa ggg aaa ctg ctg att ggc aca gtg ttt cac aaa get 1059 
Ser Val Tyr Lys Gly Lys Leu Leu lie Gly Thr Val Phe His Lys Ala 
335 340 . 345 350 

ctt tac tgt gag etc taa cagaccgatt tgcacccatg ccatagaaac 1107 
Leu Tyr Cys Glu Leu * 
355 

tgaggecatt atttcaaccg ettgecatat tccgaggacc cagtgttctt agctgaacaa 1167 
tgaatgctga ccctaaatgt ggacatcatg aagcatcaaa gcactgttta actgggagtg 1227 


WO 02/072604 




PCT/US02/06728 


-27- 


atatgatgtg tagggctttt ttttgagaat acactatcaa atcagtcttg gaatacttga 1287 
aaacctcatt taccataaaa atccttctca ctaaaatgga taaatcagtt aaaaaaaaa 1346 

<210> 22 

<211> 355 

<212> PRT 

<213> Homo sapien 

<400> 22 

Met Ala Lys Leu lie Ala Leu Thr Leu Leu Gly Met Gly Leu Ala Leu 

15 10 15 

Phe Arg Asn His Gin Ser Ser Tyr Gin Thr Arg Leu Asn Ala Leu Arg 

20 25 30 

Glu Val Gin Pro Val Glu Leu Pro Asn Cys Asn Leu Val Lys Gly lie 

35 40 45 

Glu Thr Gly Ser Glu Asp Met Glu He Leu Pro Asn Gly Leu Ala Phe 

50 55 60 

He Ser Ser Gly Leu Lys Tyr Pro Gly He Lys Ser Phe Asn Pro Asn 
65 70 75 80 

Ser Pro Glv Lys He Leu Leu Met Asp Leu Asn Glu Glu Asp Pro Thr 

85 90 95 

Val Leu Glu Leu Gly He Thr Gly Ser Lys Phe Asp Val Ser Ser Phe 

100 105 HO 

Asn Pro His Gly He Ser Thr Phe Thr Asp Glu Asp Asn Ala Met Tyr 

115 120 125 

Leu Leu Val Val Asn His Pro Asp Ala Lys Ser Thr Val Glu Leu Phe 

130 135 140 

Lvs Phe Gin Glu Glu Glu Lys Ser Leu Leu His Leu Lys Thr He Arg 
145 150 155 160 

His Lys Leu Leu Pro Asn Leu Asn Asp He Val Ala Val Gly Pro Glu 

165 170 175 

His Phe Tyr Gly Thr Asn Asp His Tyr Phe Leu Asp Pro Tyr Leu Gin 

180 185 190 

Ser Trp Glu Met Tyr Leu Gly Leu Ala Trp Ser Tyr Val Val Tyr Tyr 

195 200 205 

Ser Pro Ser Glu Val Arg Val Val Ala Glu Gly Phe Asp Phe Ala Asn 

210 215 220 

Gly He Asn He Ser Pro Asp Gly Lys Tyr Val Tyr He Ala Glu Leu 
225 230 235 240 

Leu Ala His Lys He His Val Tyr Glu Lys His Ala Asn Trp Thr Leu 

245 250 255 

Thr Pro Leu Lys Ser Leu Asp Phe Asn Thr Leu Val Asp Asn He Ser 

260 265 270 

Val Asp Pro Glu Thr Gly Asp Leu Trp Val Gly Cys His Pro Asn Gly 

275 280 285 

Met Lys He Phe Phe Tyr Asp Ser Glu Asn Pro Pro Ala Ser Glu Val 

290 295 300 

Leu Arg He Gin Asn He Leu Thr Glu Glu Pro Lys Val Thr Gin Val 
305 310 315 320 

Tyr Ala Glu Asn Gly Thr Val Leu Gin Gly Ser Thr Val Ala Ser Val 

325 330 335 

Tyr Lys Gly Lys Leu Leu He Gly Thr Val Phe His Lys Ala Leu Tyr 
340 345 350 

Cys Glu Leu 


355 


<210> 23 
<2H> 1570 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1097) 

<223> Nucleotide sequence encoding paraoxonase 2 (PON2) 
<400> 23 

egg age gag gca gcg cgc ccg get ccc gcg cca tgg ggc ggc tgg tgg 48 
Arg Ser Glu Ala Ala Arg Pro Ala Pro Ala Pro Trp Gly Gly Trp Trp 
15 10 15 

ctg tgg get tgc tgg gga teg cgc tgg cgc tec tgg gcg aga ggc ttc 96 
Leu Trp Ala Cys Trp Gly Ser Arg Trp Arg Ser Trp Ala Arg Gly Phe 
20 25 30 


tgg cac tea gaa ate gac tta aag cct cca gag aag tag aat ctg tag 
Trp His Ser Glu He Asp Leu Lys Pro Pro Glu Lys * Asn Leu * 
35 40 45 


ata ttg aca tac ttc cca atg gtc tgg ctt ttt tta gtg tgg gtc taa 
He Leu Thr Tyr Phe Pro Met Val Trp Leu Phe Leu Val Trp Val * 
65 70 75 

aat tec cag gac tec aca get ttg cac cag ata age ctg gag gaa tac 
Asn Ser Gin Asp Ser Thr Ala Leu His Gin He Ser Leu Glu Glu Tyr 
80 85 90 

taa tga tgg ate taa aag aag aaa aac caa ggg cac ggg aat taa gaa 
* * Trp He * Lys Lys Lys Asn Gin Gly His Gly Asn * Glu 
95 100 

tea gtc gtg ggt ttg att tgg cct cat tea ate cac atg gca tea gca 
Ser Val Val Gly Leu He Trp Pro His Ser He His Met Ala Ser Ala 
105 HO H5 120 


144 


acc ttc cac act gee acc tga tta aag gaa ttg aag ctg get ctg aag 192 
Thr Phe His Thr Ala Thr * Leu Lys Glu Leu Lys Leu Ala Leu Lys 
50 55 60 


240 


288 


336 


384 


ctt tea tag aca acg atg aca cag ttt ate tct ttg ttg taa acc acc 432 
Leu Ser * Thr Thr Met Thr Gin Phe He Ser Leu Leu * Thr Thr 
125 130 

cag aat tea aga ata cag tgg aaa ttt tta aat ttg aag aag cag aaa 480 
Gin Asn Ser Arg He Gin Trp Lys Phe Leu Asn Leu Lys Lys Gin Lys 
135 140 145 150 

att etc tgt tgc ate tga aaa cag tea aac atg age ttc ttc caa gtg 528 
He Leu Cys Cys He * Lys Gin Ser Asn Met Ser Phe Phe Gin Val 
155 160 165 

tga atg aca tea cag ctg ttg gac egg cac att tct atg cca caa atg 576 
* Met Thr Ser Gin Leu Leu Asp Arg His He Ser Met Pro Gin Met 
170 175 180 

acc act act tct ctg ate ctt tct taa agt att tag aaa cat act tga 624 
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Thr Thr Thr Ser Leu lie Leu Ser * Ser He * Lys His Thr + 
185 190 

act tac act ggg caa atg ttg ttt act aca gtc caa atg aag tta aag 672 
Thr Tyr Thr Gly Gin Met Leu Phe Thr Thr Val Gin Met Lys Leu Lys 
195 200 205 


tgg tag cag aag gat ttg att cag caa atg gga tea ata ttt cac ctg 
Trp * Gin Lys Asp Leu He Gin Gin Met Gly Ser He Phe His Leu 
210 215 220 


720 


atg ata agt ata tct atg ttg ctg aca tat tgg etc atg aaa ttc atg 768 

Met He Ser He Ser Met Leu Leu Thr Tyr Trp Leu Met Lys Phe Met 
225 230 235 240 

ttt tgg aaa aac aca eta ata tga att taa etc agt tga agg tac ttg 816 

Phe Trp Lys Asn Thr Leu He * He * Leu Ser * Arg Tyr Leu 
245 250 

age tgg ata cac tgg tgg ata att tat eta ttg ate ctt cct egg ggg 864 

Ser Trp He His Trp Trp He He Tyr Leu Leu He Leu Pro Arg Gly 
255 260 265 

aca tct ggg tag get gtc ate eta atg gee aga age tct teg tgt atg 912 
Thr Ser Gly * Ala Val He Leu Met Ala Arg Ser Ser Ser Cys Met 
270 275 280 


acc cga aca ate etc cct cgt cag agg ttc tec gca tec aga aca ttc 
Thr Arg Thr He Leu Pro Arg Gin Arg Phe Ser Ala Ser Arg Thr Phe 
285 290 295 300 


960 


tat ctg aga age eta cag tga eta cag ttt atg cca aca atg ggt ctg 1008 
Tyr Leu Arg Ser Leu Gin * Leu Gin Phe Met Pro Thr Met Gly Leu 
305 310 315 

ttc tec aag gaa gtt ctg tag cct cag tgt atg atg gga age tgc tea 1056 
Phe Ser Lys Glu Val Leu * Pro Gin Cys Met Met Gly Ser Cys Ser 
320 325 330 

tag gca ctt tat acc aca gag cct tgt att gtg aac tct aa attgtacttt 1107 
* Ala Leu Tyr Thr Thr Glu Pro Cys He Val Asn Ser 
335 340 

tggcatgaaa gtgegataac ttaacaatta attttctatg aattgctaat tctgagggaa 1167 

tttaaccagc aacattgacc cagaaatgta tggcatgtgt agttaatttt attccagtaa 1227 

ggaacggccc ttttagttct tagagcactt ttaacaaaaa aggaaaatga acaggttctt 1287 

taaaatgeca agcaagggac agaaaagaaa getgettteg aataaagtga atacattttg 1347 

cacaaagtaa gcctcacctt tgccttccaa ctgccagaac atggattcca ctgaaataga 1407 

gtgaattata tttccttaaa atgtgagtga cctcacttct ggcactgtga ctactatggc 1467 

tgtttagaac tactgataac gtattttgat gttttgtact tacatctttg tttaccatta 1527 

aaaagttgga gttatattaa agactaacta aaatcccagt ttt 1570 

<210> 24 

<211> 342 

<212> PRT 

<213> Homo sapien 

<400> 24 


WO 02/072604 




PCT/US02/06728 


-30- 


Arg Ser Glu Ala Ala Arg 

Pro Ala Pro 

Ala 

Pro 

Trp Gly Gly Trp Trp 

1 5 



10 



15 


Leu Trp Ala Cys Trp Gly 

Ser Arg Trp Arg 

Ser 

Trp Ala Arg Gly Phe 

20 


25 



30 



Trp His Ser Glu lie Asp 

Leu Lys Pro 

Pro 

Glu 

Lys Asn Leu 

Thr 

Phe 

35 


40 



45 



His Thr Ala Thr Leu 

Lys 

Glu Leu Lys 

Leu 

Ala 

Leu Lys He 

Leu 

Thr 

50 

55 



60 



Tyr Phe Pro Met Val 

Trp 

Leu Phe Leu 

Val 

Trp 

Val Asn Ser Gin Asp 

65 

70 



75 



60 

Ser Thr Ala Leu His 

Gin 

He Ser Leu 

Glu 

Glu 

Tyr Trp He 

Lys 

Lys 

85 



90 



95 


Lys Asn Gin Gly His 

Gly 

Asn Glu Ser 

Val 

Val 

Gly Leu He Trp 

Pro 

100 


105 



110 



His Ser lie His Met 

Ala 

Ser Ala Leu 

Ser 

Thr 

Thr Met Thr 

Gin 

Phe 

115 


120 



125 



lie Ser Leu Leu Thr 

Thr 

Gin Asn Ser Arg 

He 

Gin Trp Lys 

Phe 

Leu 

130 


135 



140 



Asn Leu Lys Lys Gin Lys 

He Leu Cys 

Cys 

He 

Lys Gin Ser 

Asn 

Met 

145 

150 



155 

Asp Arg His 

He 

160 

Ser Phe Phe Gin Val 

Met 

Thr Ser Gin 

Leu 

Leu 

Ser 

165 



170 



175 


Met Pro Gin Met Thr 

Thr 

Thr Ser Leu 

He 

Leu 

Ser Ser He 

Lys 

His 

180 


185 



190 



Thr Thr Tyr Thr Gly Gin 

Met Leu Phe 

Thr 

Thr 

Val Gin Met 

Lys 

Leu 

195 


200 



205 

His 


Lys Trp Gin Lys Asp 

Leu 

lie Gin Gin 

Met 

Gly 

Ser He Phe 

Leu 

210 


215 



220 



Met lie Ser He Ser 

Met 

Leu Leu Thr Tyr 

Trp 

Leu Met Lys 

Phe 

Met 

225 

230 



235 



240 

Phe Trp Lys Asn Thr 

Leu 

He He Leu 

Ser 

Arg 

Tyr Leu Ser Trp 

He 

245 



250 



255 


His Trp Trp He He 

Tyr 

Leu Leu He 

Leu 

Pro 

Arg Gly Thr Ser Gly 

260 

265 



270 



Ala Val He Leu Met 

Ala 

Arg Ser Ser 

Ser 

Cys 

Met Thr Arg 

Thr 

He 

275 


280 



285 



Leu Pro Arg Gin Arg 

Phe 

Ser Ala Ser Arg 

Thr 

Phe Tyr Leu 

Arg 

Ser 

290 


295 



300 



Leu Gin Leu Gin Phe 

Met 

Pro Thr Met 

Gly 

Leu 

Phe Ser Lys 

Glu 

Val 

305 

310 



315 



320 

Leu Pro Gin Cys Met 

Met 

Gly Ser Cys 

Ser 

Ala 

Leu Tyr Thr 

Thr 

Glu 

325 



330 



335 


Pro Cys He Val Asn 

Ser 








<210> 25 

<211> 533 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (47) . . . (346) 

<223> Nucleotide sequence encoding apolipoprotein 
C-III {APOC3 ) 


340 


<400> 25 

tgctcagttc atccctagag gcagctgctc caggaacaga ggtgcc atg cag ccc 


55 
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Met Gin Pro 
1 

egg gta etc ctt gtt gtt gec etc ctg gcg etc ctg gec tct gee cga 103 
Arg Val Leu Leu Val Val Ala Leu Leu Ala Leu Leu Ala Ser Ala Arg 
5 10 15 

get tea gag gee gag gat gee tec ctt etc age ttc atg cag ggt tac 151 
Ala Ser Glu Ala Glu Asp Ala Ser Leu Leu Ser Phe Met Gin Gly Tyr 
20 25 30 35 

atg aag cac gee ace aag acc gee aag gat gca ctg age age gtg cag 199 
Met Lys His Ala Thr Lys Thr Ala Lys Asp Ala Leu Ser Ser Val Gin 
40 45 50 

gag tec cag gtg gec cag cag gec agg ggc tgg gtg acc gat ggc ttc 247 
Glu Ser Gin Val Ala Gin Gin Ala Arg Gly Trp Val Thr Asp Gly Phe 
55 60 65 

agt tec ctg aaa gac tac tgg age acc gtt aag gac aag ttc tct gag 295 
Ser Ser Leu Lys Asp Tyr Trp Ser Thr Val Lys Asp Lys Phe Ser Glu 
70 75 80 

ttc tgg gat ttg gac cct gag gtc aga cca act tea gee gtg get gee 34 3 

Phe Trp Asp Leu Asp Pro Glu Val Arg Pro Thr Ser Ala Val Ala Ala 
85 90 95 

tga gacctcaata ccccaagtcc acctgcctat ccatcctgcg agctccttgg 396 


gtcctgcaat ctccagggct gcccctgtag gttgcttaaa agggacagta ttctcagtgc 456 
tctcctaccc cacctcatgc ctggcccccc tccaggcatg ctggcctccc aataaagctg 516 
gacaagaagc tgctatg 533 

<210> 26 

<211> 99 

<212> PRT 

<213> Homo sapien 

<400> 26 

Met Gin Pro Arg Val Leu Leu Val Val Ala Leu Leu Ala Leu Leu Ala 

15 10 15 

Ser Ala Arg Ala Ser Glu Ala Glu Asp Ala Ser Leu Leu Ser Phe Met 

20 25 30 

Gin Gly Tyr Met Lys His Ala Thr Lys Thr Ala Lys Asp Ala Leu Ser 

35 40 45 

Ser Val Gin Glu Ser Gin Val Ala Gin Gin Ala Arg Gly Trp Val Thr 

50 55 60 

Asp Gly Phe Ser Ser Leu Lys Asp Tyr Trp Ser Thr Val Lys Asp Lys 
65 70 75 80 

Phe Ser Glu Phe Trp Asp Leu Asp Pro Glu Val Arg Pro Thr Ser Ala 
85 90 95 

Val Ala Ala 


<210> 27 
<211> 8925 
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<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (5020) . . . (6162) 

<223> Nucleotide encoding ATP-binding cassette (ABC1) 
<223> n= a or g or c or t 
<400> 27 

ctcagtgtca gctgctgctg gaagtggcct ggcctctatt tatcttcctg atcctgatct 60 

ctgttcggct gagctaccca ccctatgaac aacatgaatg ccattttcca aataaagcca 120 

tgccctctgc aggaacactt ccttgggttc aggggattat ctgtaatgcc aacaacccct 180 

gtttccgtta cccgactcct ggggaggctc ccggagttgt tggaaacttt aacaaatcca 240 

ttgtggctcg cctgttctca gatgctcgga ggcttctttt atacagccag aaagacacca 300 

gcatgaagga catgcgcaaa gttctgagaa cattacagca gatcaagaaa tccagctcaa 360 

acttgaagct tcaagatttc ctggtggaca atgaaacctt ctctgggttc ctgtatcaca 420 

acctctctct cccaaagtct actgtggaca agatgctgag ggctgatgtc attctccaca 480 

aggtattttt gcaaggctac cagttacatt tgacaagtct gtgcaatgga tcaaaatcag 540 

aagagatgat tcaacttggt gaccaagaag tttctgagct ttgtggccta ccaagggaga 600 

aactggctgc agcagagcga gtacttcgtt ccaacatgga catcctgaag ccaatcctga 660 

gaacactaaa ctctacatct cccttcccga gcaaggagct ggctgaagcc acaaaaacat 720 

tgctgcatag tcttgggact ctggcccagg agctgttcag catgagaagc tggagtgaca 780 

tgcgacagga ggtgatgttt ctgaccaatg tgaacagctc cagctcctcc acccaaatct 840 

accaggctgt gtctcgtatt gtctgcgggc atcccgaggg aggggggctg aagatcaagt 900 

ctctcaactg gtatgaggac aacaactaca aagccctctt tggaggcaat ggcactgagg 960 

aagatgctga aaccttctat gacaactcta caactcctta ctgcaatgat ttgatgaaga 1020 

atttggagtc tagtcctctt tcccgcatta tctggaaagc tctgaagccg ctgctcgttg 1080 

ggaagatcct gtatacacct gacactccag ccacaaggca ggtcatggct gaggtgaaca 1140 

agaccttcca ggaactggct gtgttccatg atctggaagg catgtgggag gaactcagcc 1200 

ccaagatctg gaccttcatg gagaacagcc aagaaatgga ccttgtccgg atgctgttgg 1260 

acagcaggga caatgaccac ttttgggaac agcagttgga tggcttagat tggacagccc 1320 

aagacatcgt ggcgtttttg gccaagcacc cagaggatgt ccagtccagt aatggttctg 1380 

tgtacacctg gagagaagct ttcaacgaga ctaaccaggc aatccggacc atatctcgct 1440 

tcatggagtg tgtcaacctg aacaagctag aacccatagc aacagaagtc tggctcatca 1500 

acaagtccat ggagctgctg gatgagagga agttctgggc tggtattgtg ttcactggaa 1560 

ttactccagg cagcattgag ctgccccatc atgtcaagta caagatccga atggacattg 1620 

acaatgtgga gaggacaaat aaaatcaagg atgggtactg ggaccctggt cctcgagctg 1680 

acccctttga ggacatgcgg tacgtctggg ggggcttcgc ctacttgcag gatgtggtgg 1740 

agcaggcaat catcagggtg ctgacgggca ccgagaagaa aactggtgtc tatatgcaac 1800 

agatgcccta tccctgttac gttgatgaca tctttctgcg ggtgatgagc cggtcaatgc 1860 

ccctcttcat gacgctggcc tggatttact cagtggctgt gatcatcaag ggcatcgtgt 1920 

atgagaagga ggcacggctg aaagagacca tgcggatcat gggcctggac aacagcatcc 1980 

tctggtttag ctggttcatt agtagcctca ttcctcttct tgtgagcgct ggcctgctag 2040 

tggtcatcct gaagttagga aacctgctgc cctacagtga tcccagcgtg gtgtttgtct 2100 

tcctgtccgt gtttgctgtg gtgacaatcc tgcagtgctt cctgattagc acactcttct 2160 

ccagagccaa cctggcagca gcctgtgggg gcatcatcta cttcacgctg tacctgccct 2220 

acgtcctgtg tgtggcatgg caggactacg tgggcttcac actcaagatc ttcgctagcc 2280 

tgctgtctcc tgtggctttt gggtttggct gtgagtactt tgcccttttt gaggagcagg 2340 

gcattggagt gcagtgggac aacctgtttg agagtcctgt ggaggaagat ggcttcaatc 24 00 

tcaccacttc ggtctccatg atgctgtttg acaccttcct ctatggggtg atgacctggt 2460 

acattgaggc tgtctttcca ggccagtacg gaattcccag gccctggtat tttccttgca 2520 

ccaagtccta ctggtttggc gaggaaagtg atgagaagag ccaccctggt tccaaccaga 2580 

agagaatatc agaaatctgc atggaggagg aacccaccca cttgaagctg ggcgtgtcca 2640 

ttcagaacct ggtaaaagtc taccgagatg ggatgaaggt ggctgtcgat ggcctggcac 2700 

tgaattttta tgagggccag atcacctcct tcctgggcca caatggagcg gggaagacga 2760 

ccaccatgtc aatcctgacc gggttgttcc ccccgacctc gggcaccgcc tacatcctgg 2820 
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gaaaagacat tcgctctgag atgagcacca tccggcagaa cctgggggtc tgtccccagc 2880 

ataacgtgct gtttgacatg ctgactgtcg aagaacacat ctggttctat gcccgcttga 2940 

aagggctctc tgagaagcac gtgaaggcgg agatggagca gatggccctg gatgttggtt 3000 

tgccatcaag caagctgaaa agcaaaacaa gccagctgtc aggtggaatg cagagaaagc 3060 

tatctgtggc cttggccttt gtcgggggat ctaaggttgt cattctggat gaacccacag 3120 

ctggtgtgga cccttactcc cgcaggggaa tatgggagct gctgctgaaa taccgacaag 3180 

gccgcaccat tattctctct acacaccaca tggatgaagc ggacgtcctg ggggacagga 3240 

ttgccatcat ctcccatggg aagctgtgct gtgtgggctc ctccctgttt ctgaagaacc 3300 

agctgggaac aggctactac ctgaccttgg tcaagaaaga tgtggaatcc tccctcagtt 3360 

cctgcagaaa cagtagtagc actgtgtcat acctgaaaaa ggaggacagt gtttctcaga 3420 

gcagttctga tgctggcctg ggcagcgacc atgagagtga cacgctgacc atcgatgtct 3480 

ctgctatctc caacctcatc aggaagcatg tgtctgaagc ccggctggtg gaagacatag 3540 

ggcatgagct gacctatgtg ctgccatatg aagctgctaa ggagggagcc tttgtggaac 3600 

tctttcatga gattgatgac cggctctcag acctgggcat ttctagttat ggcatctcag 3660 

agacgaccct ggaagaaata ttcctcaagg tggccgaaga gagtggggtg gatgctgaga 3720 

cctcagatgg taccttgcca gcaagacgaa acaggcgggc cttcggggac aagcagagct 3780 

gtcttcgccc gttcactgaa gatgatgctg ctgatccaaa tgattctgac atagacccag 3840 

aatccagaga gacagacttg ctcagtggga tggatggcaa agggtcctac caggtgaaag 3900 

gctggaaact tacacagcaa cagtttgtgg cccttttgtg gaagagactg ctaattgcca 3960 

gacggagtcg gaaaggattt tttgctcaga ttgtcttgcc agctgtgttt gtctgcattg 4020 

cccttgtgtt cagcctgatc gtgccaccct ttggcaagta ccccagcctg gaacttcagc 4080 

cctggatgta caacgaacag tacacatttg tcagcaatga tgctcctgag gacacgggaa 4140 

ccctggaact cttaaacgcc ctcaccaaag accctggctt cgggacccgc tgtatggaag 4200 

gaaacccaat cccagacacg ccctgccagg caggggagga agagtggacc actgccccag 4260 

ttccccagac catcatggac ctcttccaga atgggaactg gacaatgcag aacccttcac 4320 

ctgcatgcca gtgtagcagc gacaaaatca agaagatgct gcctgtgtgt cccccagggg 4380 

caggggggct gcctcctcca caaagaaaac aaaacactgc agatatcctt caggacctga 4440 

caggaagaaa catttcggat tatctggtga agacgtatgt gcagatcata gccaaaagct 4500 

taaagaacaa gatctgggtg aatgagttta ggtatggcgg cttttccctg ggtgtcagta 4560 

atactcaagc acttcctccg agtcaagaag ttaatgatgc catcaaacaa atgaagaaac 4620 

acctaaagct ggccaaggac agttctgcag atcgatttct caacagcttg ggaagattta 4680 

tgacaggact ggacaccaaa aataatgtca aggtgtggtt caataacaag ggctggcatg 4740 

caatcagctc tttcctgaat gtcatcaaca atgccattct ccgggccaac ctgcaaaagg 4800 

gagagaaccc tagccattat ggaattactg ctttcaatca tcccctgaat ctcaccaagc 4860 

agcagctctc agaggtggct cggatgacca catcagtgga tgtccttgtg tccatctgtg 4920 

tcatctttgc aatgtccttc gtcccagcca gctttgtcgt attcctgatc caggagcggg 4980 

tcagcaaagc aaaacacctg cagttcatca gtggagtga age ctg tea tct act 5034 

Ser Leu Ser Ser Thr 
1 5 

ggc tct eta att ttg tct ggg ata tgt gca att aag ttg ttt cca ann 5082 
Gly Ser Leu He Leu Ser Gly He Cys Ala He Lys Leu Phe Pro Xaa 
10 15 20 

nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5130 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
25 30 35 

nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5178 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 

nta ate ttt cct ttt cag tgc ttt ggg etc ctg gga gtt aat ggg get 5226 
Xaa He Phe Pro Phe Gin Cys Phe Gly Leu Leu Gly Val Asn Gly Ala 
55 60 65 

gga aaa tea tea act ttc aag atg tta aca gga gat ace act gtt acc 5274 
Gly Lys Ser Ser Thr Phe Lys Met Leu Thr Gly Asp Thr Thr Val Thr 
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70 75 80 85 

aga gga gat get ttc ctt aac att tgc agt ate tta tea aac ate cat 
Arg Gly Asp Ala Phe Leu Asn lie Cys Ser lie Leu Ser Asn lie His 
90 95 100 

gaa gta cat cag aac atg ggc tac tgc cct cag ttt gat gee ate aca 
Glu Val His Gin Asn Met Gly Tyr Cys Pro Gin Phe Asp Ala He Thr 
105 HO 115 


cct cct gtg gtg ttt ctg gat gaa ccc acc aca ggc atg gat ccc aaa 
Pro Pro Val Val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Lys 
185 190 195 


agt gtc cag cat eta aaa aat agg ttt gga gat ggt tat aca ata gtt 
Ser Val Gin His Leu Lys Asn Arg Phe Gly Asp Gly Tyr Thr He Val 
250 255 260 

gta cga ata gca ggg tec aac ccg gac ctg aag cct gtc cag gat ttc 
Val Arg He Ala Gly Ser Asn Pro Asp Leu Lys Pro Val Gin Asp Phe 
265 270 275 

ttt gga ctt gca ttt cct gga agt gtt eta aaa gag aaa cac egg aac 
Phe Gly Leu Ala Phe Pro Gly Ser Val Leu Lys Glu Lys His Arg Asn 
280 285 290 

atg eta caa tac cag ctt cca tct tea tta tct tct ctg gec agg ata 
Met Leu Gin Tyr Gin Leu Pro Ser Ser Leu Ser Ser Leu Ala Arg He 
295 300 305 


5322 


5370 


gag ctg ttg act ggg aga gaa cac gtg gag ttc ttt gee ctt ttg aga 5418 
Glu Leu Leu Thr Gly Arg Glu His Val Glu Phe Phe Ala Leu Leu Arg 
120 125 130 

gga gtc cca gag aaa gaa gtt ggc aag gtt ggt gag tgg gcg att egg 5466 
Gly Val Pro Glu Lys Glu Val Gly Lys Val Gly Glu Trp Ala He Arg 
135 140 145 

aaa ctg ggc etc gtg aag tat gga gaa aaa tat get ggt aac tat agt 5514 
Lys Leu Gly Leu Val Lys Tyr Gly Glu Lys Tyr Ala Gly Asn Tyr Ser 
150 155 160 165 

gga ggc aac aaa cgc aag etc tct aca gee atg get ttg ate ggc ggg 5562 
Gly Gly Asn Lys Arg Lys Leu Ser Thr Ala Met Ala Leu He Gly Gly 
y 170 175 180 


5610 


gec egg egg ttc ttg tgg aat tgt gee eta agt gtt gtc aag gag ggg 5658 
Ala Arg Arg Phe Leu Trp Asn Cys Ala Leu Ser Val Val Lys Glu Gly 
200 205 210 

aga tea gta gtg ctt aca tct cat agt atg gaa gaa tgt gaa get ctt 5706 
Arg Ser Val Val Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu 
215 220 225 

tgc act agg atg gca ate atg gtc aat gga agg ttc agg tgc ctt ggc 5754 
Cys Thr Arg Met Ala He Met Val Asn Gly Arg Phe Arg Cys Leu Gly 
230 235 240 245 


5802 


5850 


5898 


5946 
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ttc age ate etc tec cag age aaa aag cga etc cac ata gaa gac tac 
Phe Ser He Leu Ser Gin Ser Lys Lys Arg Leu His He Glu Asp Tyr 
310 315 320 325 

tct gtt tct cag aca aca ctt gac caa gta ttt gtg aac ttt gee aag 
Ser Val Ser Gin Thr Thr Leu Asp Gin Val Phe Val Asn Phe Ala Lys 
330 335 340 

gac caa agt gat gat gac cac tta aaa gac etc tea tta cac aaa aac 
Asp Gin Ser Asp Asp Asp His Leu Lys Asp Leu Ser Leu His Lys Asn 
345 350 355 

cag aca gta gtg gac gtt gca gtt etc aca tct ttt eta cag gat gag 
Gin Thr Val Val Asp Val Ala Val Leu Thr Ser Phe Leu Gin Asp Glu 
360 365 370 

aaa gtg aaa gaa age tat gta tga agaatcctgt teataegggg tggctgaaag 
Lys Val Lys Glu Ser Tyr Val * 
375 380 


5994 


6042 


6090 


6138 


6192 


taaagaggaa 
gttgatgtgg 
aatgcaatga 
ggctctcaag 
tggaacccaa 
tattctcatt 
aaaatcaaaa 
ttcccggtga 
tcagaaagtt 
gagtctatca 
tttgattccc 
tccaggcacg 
acagggtcat 
aacagecaaa 
tgcgttcaaa 
ctgcatctca 
agaaaatgaa 
ggaattttaa 
gtggtataga 
atcagtcatg 
tgagtttctt 
tgagtgtatg 
tctcagtaca 
tgtggttgta 
geaacaatge 
cagggaaaac 
gtaccttcaa 
atttttcatc 
ggagcatttt 
aaatactgtt 
aactattgtt 
ttccagaatt 
tatatttttt 
gttttagaag 
aactagtgat 
taatttctcc 
ategtattet 
gatttcaagg 


ctagactttc 
gaagaagtaa 
aaacaaaatt 
tgaaagactt 
tggacatatg 
ggggttgcaa 
ggtaatgeae 
cacatccatt 
tgaagcacca 
acattgaata 
tctgataagc 
ggaaacttgg 
ccttatgaga 
ctgctggggc 
cctagggaag 
agatgtttat 
aagatggagt 
gttctatcag 
ggagtatggc 
actagtgect 
tcagatcatt 
ctgtagctga 
cttcctgtgc 
ggagcccact 
aaaagccaag 
agctagcttg 
ataattggct 
tcttcaatca 
tcagagtttt 
tcactaatac 
tggggaagtt 
tgaatattaa 
aaatttacag 
ttaaagtcaa 
atggcatcgt 
tcatttaata 
aattcttcat 
ccatatttta 


ctttgcacca 
actggatact 
ccattacagg 
gaatttagtt 
ggtttgaact 
caataattca 
atcctcattc 
gctggcaatg 
tggtgtgtca 
tcagttgaca 
tgttctggtg 
ttccattgtt 
ctcttaaata 
tgcaagctgc 
cctgtgccca 
ctgacacaag 
tgtattttga 
tgacttctga 
cactgcccca 
agaaagcaat 
taggatactc 
aagagtatgt 
catgttattc 
gtaacaatac 
aaagtataag 
aaaacttget 
ttgeagatat 
ctagtcaaga 
ctaacccagt 
ttactgttaa 
caagtgatct 
cgctaaaggt 
aatattatat 
tattgatttt 
tgeattttae 
tttttcaaaa 
tatagtaaat 
aaaaatcaaa 


tgtgaagtgt 
gtactgatac 
ggcagtgcct 
ttttacctat 
cacacttttt 
tcaagtaatc 
actaagecat 
agtgtgccag 
tgctcacttt 
gaatggtgcc 
gcagtaacat 
atattgtcct 
tacttagatc 
tgaagecagg 
tttgtcctga 
tgtattattt 
caaaaatgtt 
atccttagaa 
ctatttttat 
gtgatggtca 
ttaatctcac 
aegtaegtat 
agctcactgg 
tgggcagect 
ggtcacaagt 
gaaaaacaca 
tggatacccc 
aaaatataaa 
cttatttttc 
ctgtcttgag 
ttcaatatca 
gtaagacttc 
aacccactgc 
aaatataagt 
agtatcttca 
tcaaagttat 
ctatgagcaa 
aggcactgtg 


tgtggagaaa 
tattcaatgc 
ttgtagccta 
acctatgtga 
tttttttttt 
atggccagcg 
gccatgccca 
agttattagt 
tgtgaaagct 
atgcgtggct 
gcaacaaaaa 
atgettcgag 
ctggtaagag 
gcatgggatt 
ctgtctgcta 
ctggcttttt 
tgtacttttt 
tggcctcttt 
tttcttatgt 
ggatctcatg 
ttcatcaatc 
aagactagag 
tttacaaata 
tttttttttt 
ctaaacaatg 
acttgtgttt 
attaaatctg 
aacaacaaat 
tagtcagtaa 
agaaaagaaa 
ttactaactt 
agatttcaaa 
tgaaaaagaa 
aatgaaggca 
aaaatacaga 
ggtttcctca 
ctccttactt 
aactattttg 


agagecagaa 
aatgeaatte 
tgtcttgtat 
aactctatta 
tgttcctgtg 
attattgatc 
ggagactggt 
gccaagtttt 
gctctgctca 
aacatcctgc 
tgtgggtgtc 
ccatgggtct 
gcaaagaatc 
aaagagattg 
acatggtaca 
gaattaatct 
aatgttattt 
gtagaaccct 
aagtttgcat 
acattatatt 
aaatattttt 
agatattaag 
taggttgtct 
ttttttaatt 
aattcttcaa 
atggcattta 
acagtctcaa 
acttccatat 
acatttgtaa 
aatatgagag 
cttccacttt 
ttaatctttc 
aaaaatgatt 
tatttccaat 
atttatagaa 
ttttactaaa 
cggttcctct 
aagaaaaeae 


6252 
6312 
6372 
6432 
6492 
6552 
6612 
6672 
6732 
6792 
6852 
6912 
6972 
7032 
7092 
7152 
7212 
7272 
7332 
7392 
7452 
7512 
7572 
7632 
7692 
7752 
7812 
7872 
7932 
7992 
8052 
8112 
8172 
8232 
8292 
8352 
8412 
8472 
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aacattttaa tacagattga aaggacctct tctgaagcta gaaacaatct atagttatac 8532 

atcttcatta atactgtgtt accttttaaa atagtaattt tttacatttt cctgtgtaaa 8592 

cctaattgtg gtagaaattt ttaccaactc tatactcaat caagcaaaat ttctgtatat 8652 

tccctgtgga atgtacctat gtgagtttca gaaattctca aaatacgtgt tcaaaaattt 8712 

ctgcttttgc atctttggga cacctcagaa aacttattaa caactgtgaa tatgagaaat 8772 

acagaagaaa ataataagcc ctctatacat aaatgcccag cacaattcat tgttaaaaaa 8832 

caaccaaacc tcacactact gtatttcatt atctgtactg aaagcaaatg ctttgtgact 8892 

attaaatgtt gcacatcatt cattcactgt ata 8925 

<210> 28 

<211> 380 

<212> PRT 

<213> Homo sapien 

<220> 

<221> UNSURE 
<222> <21) . . . (54) 
<223> Xaa = unknown 

<400> 28 

Ser Leu Ser Ser Thr Gly Ser Leu He Leu Ser Gly He Cys Ala He 

15 10 15 

Lys Leu Phe Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa He Phe Pro Phe Gin Cys Phe Gly Leu Leu 

. 50 55 60 

Gly Val Asn Gly Ala Gly Lys Ser Ser Thr Phe Lys Met Leu Thr Gly 
65 70 75 80 

Asp Thr Thr Val Thr Arg Gly Asp Ala Phe Leu Asn He Cys Ser He 

85 90 95 

Leu Ser Asn He His Glu Val His Gin Asn Met Gly Tyr Cys Pro Gin 

100 105 HO 

Phe Asp Ala He Thr Glu Leu Leu Thr Gly Arg Glu His Val Glu Phe 

115 120 125 

Phe Ala Leu Leu Arg Gly Val Pro Glu Lys Glu Val Gly Lys Val Gly 

130 135 140 

Glu Trp Ala He Arg Lys Leu Gly Leu Val Lys Tyr Gly Glu Lys Tyr 
145 150 155 160 

Ala Gly Asn Tyr Ser Gly Gly Asn Lys Arg Lys Leu Ser Thr Ala Met 

165 170 175 

Ala Leu He Gly Gly Pro Pro Val Val Phe Leu Asp Glu Pro Thr Thr 

180 185 190 

Gly Met Asp Pro Lys Ala Arg Arg Phe Leu Trp Asn Cys Ala Leu Ser 

195 200 205 

Val Val Lys Glu Gly Arg Ser Val Val Leu Thr Ser His Ser Met Glu 

210 215 220 

Glu Cys Glu Ala Leu Cys Thr Arg Met Ala He Met Val Asn Gly Arg 
225 230 235 240 

Phe Arg Cys Leu Gly Ser Val Gin His Leu Lys Asn Arg Phe Gly Asp 

245 250 255 

Gly Tyr Thr He Val Val Arg He Ala Gly Ser Asn Pro Asp Leu Lys 

260 265 270 

Pro Val Gin Asp Phe Phe Gly Leu Ala Phe Pro Gly Ser Val Leu Lys 

275 280 285 

Glu Lys His Arg Asn Met Leu Gin Tyr Gin Leu Pro Ser Ser Leu Ser 
290 295 300 
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Ser Leu Ala Arg He Phe Ser He Leu Ser Gin Ser Lys Lys Arg Leu 
305 310 315 320 

His He Glu Asp Tyr Ser Val Ser Gin Thr Thr Leu Asp Gin Val Phe 

325 330 335 

Val Asn Phe Ala Lys Asp Gin Ser Asp Asp Asp His Leu Lys Asp Leu 

340 345 350 

Ser Leu His Lys Asn Gin Thr Val Val Asp Val Ala Val Leu Thr Ser 

355 360 365 

Phe Leu Gin Asp Glu Lys Val Lys Glu Ser Tyr Val 
370 375 380 

<210> 29 

<211> 897 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (39) . . . (842) 

<223> Nucleotide sequence encoding apolipoprotein A-l 
(APOA1) 

<400> 29 

agagactgcg agaaggaggt cccccacggc ccttcagg atg aaa get gcg gtg ctg 56 
* a * a Met Lys ftla Ala Val Leu 

1 5 


acc ttg gec gtg etc ttc ctg acg ggg age cag get egg cat ttc tgg 
Thr Leu Ala Val Leu Phe Leu Thr Gly Ser Gin Ala Arg His Phe Trp 
10 15 20 


gee act gtg tac gtg gat gtg etc aaa gac age ggc aga gac tat gtg 
Ala Thr Val Tyr Val Asp Val Leu Lys Asp Ser Gly Arg Asp Tyr Val 
40 45 50 

tec cag ttt gaa ggc tec gee ttg gga aaa cag eta aac eta aag etc 
Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys Gin Leu Asn Leu Lys Leu 
55 60 65 70 

ctt gac aac tgg gac age gtg acc tec acc ttc age aag ctg cgc gaa 
Leu Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu 
75 80 85 


aca gag ggc ctg agg cag gag atg age aag gat ctg gag gag gtg aag 
Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys 
105 HO 115 

gec aag gtg cag ccc tac ctg gac gac ttc cag aag aag tgg cag gag 
Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu 
120 125 130 


104 


cag caa gat gaa ccc ccc cag age ccc tgg gat cga gtg aag gac ctg 152 
Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp Asp Arg Val Lys Asp Leu 
25 30 35 


200 


248 


296 


cag etc ggc cct gtg acc cag gag ttc tgg gat aac ctg gaa aag gag 344 
Gin Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu 
90 95 100 


392 


440 
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gag atg gag etc tac cgc cag aag gtg gag ccg ctg cgc gca gag etc 
Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu 
135 140 145 150 


488 


caa gag ggc gcg cgc cag aag ctg cac gag ctg caa gag aag ctg age 536 
Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser 
155 160 165 


cca ctg ggc gag gag atg cgc gac cgc gcg cgc gec cat gtg gac gcg 
Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala 
170 175 180 


gag age ttc aag gtc age ttc ctg age get etc gag gag tac act aag 
Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys 
250 255 260 


584 


ctg cgc acg cat ctg gee ccc tac age gac gag ctg cgc cag. cgc ttg 632 
Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu 
185 190 195 

gee gcg cgc ctt gag get etc aag gag aac ggc ggc gee aga ctg gee 680 
Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala 
200 205 210 

gag tac cac gee aag gec ace gag cat ctg age acg etc age gag aag 728 
Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys 
215 220 225 230 

gee aag ccc gcg etc gag gac etc cgc caa ggc ctg ctg ccc gtg ctg 776 
Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu 
235 240 245 


824 


aag etc aac ace cag tga ggcgcccgcc gccgcccccc ttcccggtgc 872 
Lys Leu Asn Thr Gin * 
265 

tcagaataaa cgtttccaaa gtggg 897 

<210> 30 

<211> 267 

<212> PRT 

<213> Homo sapien 

<400> 30 

Met Lys Ala Ala Val Leu Thr Leu Ala Val Leu Phe Leu Thr Gly Ser 

x 5 10 15 

Gin Ala Arg His Phe Trp Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp 

20 25 30 

Asp Arg Val Lys Asp Leu Ala Thr Val Tyr Val Asp Val Leu Lys Asp 

35 40 45 

Ser Gly Arg Asp Tyr Val Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys 

50 55 60 

Gin Leu Asn Leu Lys Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr 
65 70 75 80 

Phe Ser Lys Leu Arg Glu Gin Leu Gly Pro Val Thr Gin Glu Phe Trp 

85 90 95 

Asp Asn Leu Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys 
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1 Oft 
Xvv 




nbp iJCU 

V3J.U 


Val 

Lys 

Ala Lys 






120 

Gin Lys 

Lys 

Trp 

Gin 

Glu 

Glu Met 

130 





135 

Pro Leu 

Arg 

Ala 

Glu 

Leu 

Gin Glu 

145 



150 


Leu Gin 

Glu 

Lys 

Leu 

Ser 

Pro Leu 



165 



Arg Ala 

His 

Val 

Asp 

Ala 

Leu Arg 



180 




Glu Leu 

Arg 

Gin 

Arg 

Leu 

Ala Ala 


195 




200 

Gly Gly 

Ala 

Arg 

Leu 

Ala 

Glu Tyr 

210 





215 

Ser Thr 

Leu 

Ser 

Glu 

Lys Ala Lys 

225 




230 


Gly Leu 

Leu 

Pro 

Val 

Leu 

Glu Ser 



245 



Leu Glu 

Glu 

Tyr 

Thr 

Lys 

Lys Leu 



260 








Tin 


vol 

will 

Dyn TVr T.fAii 

Iqn left 
nb^ nop 

Phe 



125 



Glu 

Leu 

Tyr Arg Gin 

Lys Val 

Glu 



140 



Gly 

Ala 

Arg Gin Lys 

Leu His 

Glu 



155 


160 

Gly 

Glu 

Glu Met Arg 

Asp Arg 

Ala 


170 


175 


Thr 

His 

Leu Ala Pro 

Tyr Ser 

Asp 

185 



190 


Arg 

Leu 

Glu Ala Leu 

Lys Glu 

Asn 


205 


His 

Ala 

Lys Ala Thr 

Glu His 

Leu 



220 



Pro 

Ala 

Leu Glu Asp 

Leu Arg 

Gin 



235 


240 

Phe 

Lys 

Val Ser Phe 

Leu Ser 

Ala 


250 


255 


Asn 

Thr 

Gin 




265 


<210> 31 

<211> 14121 

<212> DMA 

<213> Homo sapien 


<220> 
<221> CDS 

<222> (129) . . . (13820) 

<223> Nucleotide sequence encoding apolipoprotein B 
<APOB) 

<400> 31 

attcccaccg ggacctgegg ggctgagtgc ccttctcggt tgctgccgct gaggageccg 60 
cccagccagc cagggccgcg aggecgagge caggccgcag cccaggagcc gccccaccgc 120 
agctggcg atg gac ccg ccg agg ccc gcg ctg ctg gcg ctg ctg gcg ctg 170 
Met Asp Pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu 
15 10 


cct gcg ctg ctg ctg ctg ctg ctg gcg ggc gec agg gec gaa gag gaa 218 
Pro Ala Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu 
15 20 25 30 

atg ctg gaa aat gtc age ctg gtc tgt cca aaa gat gcg acc cga ttc 266 
Met Leu Glu Asn Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe 
35 40 45 

aag cac etc egg aag tac aca tac aac tat gag get gag agt tec agt 314 
Lys His Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser 
50 55 60 

gga gtc cct ggg act get gat tea aga agt gec acc agg ate aac tgc 362 
Gly Val Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg lie Asn Cys 
65 70 75 

aag gtt gag ctg gag gtt ccc cag etc tgc age ttc ate ctg aag acc 410 
Lys Val Glu Leu Glu Val Pro Gin Leu Cys Ser Phe He Leu Lys Thr 
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80 85 90 


caa gtg ttg ttt ctg gat acc gtg tat gga aac tgc tec act cac ttt 
Gin Val Leu Phe Leu Asp Thr Val Tyr Gly Asn Cys Ser Thr His Phe 
180 !8 5 190 


175 


aaa aac cto qqq cag tgt gat cgc ttc aag ccc ate cgc aca ggc ate 
Arg Asp Leu Gly Gin Cys Asp Arg Phe Lys Pro lie Arg Thr Gly lie 


210 


215 220 


age cca ctt get etc ate aaa ggc atg acc cgc ccc ttg tea act ctg 
Ser Pro Leu Ala Leu lie Lys Gly Met Thr Arg Pro Leu Ser Thr Leu 
225 230 235 

ate age age age cag tec tgt cag tac aca ctg gac get aag agg aag 
He Ser Ser Ser Gin Ser Cys Gin Tyr Thr Leu Asp Ala Lys Arg Lys 
240 245 250 

cat gtg gca gaa gec ate tgc aag gag caa cac etc ttc ctg cct ttc 
His Val Ala Glu Ala He Cys Lys Glu Gin His Leu Phe Leu Pro Phe 
255 260 265 270 

tec tac aac aat aag tat ggg atg gta gca caa gtg aca cag act ttg 
Ser Tyr Asn Asn Lys Tyr Gly Met Val Ala Gin Val Thr Gin Thr Leu 
275 280 285 

aaa ctt gaa gac aca cca aag ate aac age cgc ttc ttt ggt gaa ggt 
Lvs Leu Glu Asp Thr Pro Lys He Asn Ser Arg Phe Phe Gly Glu Gly 
1 290 295 300 

act aag aag atg ggc etc gca ttt gag age acc aaa tec aca tea cct 
Thr Lys Lys Met Gly Leu Ala Phe Glu Ser Thr Lys Ser Thr Ser Pro 
305 310 315 


458 


506 


age cag tgc acc ctg aaa gag gtg tat ggc ttc aac cct gag ggc aaa 
Ser Gin Cys Thr Leu Lys Glu Val Tyr Gly Phe Asn Pro Glu Gly Lys 
95 100 105 HO 

gee ttg ctg aag aaa acc aag aac tct gag gag ttt get gca gee atg 
Ala Leu Leu Lys Lys Thr Lys Asn Ser Glu Glu Phe Ala Ala Ala Met 
115 120 125 

tec agg tat gag etc aag ctg gee att cca gaa ggg aag cag gtt ttc 554 
Ser Arg Tyr Glu Leu Lys Leu Ala He Pro Glu Gly Lys Gin Val Phe 
130 135 140 

ctt tac ccg gag aaa gat gaa cct act tac ate ctg aac ate aag agg 602 
Leu Tyr Pro Glu Lys Asp Glu Pro Thr Tyr He Leu Asn He Lys Arg 
145 150 155 

ggc ate att tct gee etc ctg gtt ccc cca gag aca gaa gaa gee aag 650 
Gly He He Ser Ala Leu Leu Val Pro Pro Glu Thr Glu Glu Ala Lys 
160 165 170 


698 


acc gtc aag acg agg aag ggc aat gtg gca aca gaa ata tec act gaa 746 
Thr Val Lys Thr Arg Lys Gly Asn Val Ala Thr Glu He Ser Thr Glu 
195 200 205 


794 


842 


890 


938 


986 


1034 


1082 


WO 02/072604 



PCT7US02/06728 


-41- 


cca aag cag gcc gaa get gtt ttg aag act etc cag gaa ctg aaa aaa 1130 
Pro Lys Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys 
320 325 330 


eta acc ate tct gag caa aat ate cag aga get aat etc ttc aat aag 
Leu Thr lie Ser Glu Gin Asn He Gin Arg Ala Asn Leu Phe Asn Lys 
335 340 345 350 

ctg gtt act gag ctg aga ggc etc agt gat gaa gca gtc aca tct etc 
Leu Val Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu 
355 360 365 


450 455 460 


gcc ate cag get ctg egg aaa atg gag cct aaa gac aag gac cag gag 
Ala He Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu 
530 535 540 

gtt ctt ctt cag act ttc ctt gat gat get tct ccg gga gat aag cga 
Val Leu Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg 


1178 


1226 


ttg cca cag ctg att gag gtg tec age ccc ate act tta caa gcc ttg 1274 
Leu Pro Gin Leu He Glu Val Ser Ser Pro He Thr Leu Gin Ala Leu 
370 375 380 

gtt cag tgt gga cag cct cag tgc tec act cac ate etc cag tgg ctg 1322 
Val Gin Cys Gly Gin Pro Gin Cys Ser Thr His He Leu Gin Trp Leu 
385 390 395 

aaa cat gtg cat gcc aac ccc ctt ctg ata gat gtg gtc acc tac ctg 1370 
Lys Arg Val His Ala Asn Pro Leu Leu He Asp Val Val Thr Tyr Leu 
400 405 410 

gtg gcc ctg ate ccc gag ccc tea gca cag cag ctg cga gag ate ttc 1418 
Val Ala Leu He Pro Glu Pro Ser Ala Gin Gin Leu Arg Glu He Phe 
415 420 425 430 


aac atg gcg agg gat cag cgc age cga gcc acc ttg tat gcg ctg age 1466 

Asn Met Ala Arg Asp Gin Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser 
435 440 445 

cac gcg gtc aac aac tat cat aag aca aac cct aca ggg acc cag gag 

His Ala Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu 

... <rc ACCl 


1514 


ctg ctg gac att get aat tac ctg atg gaa cag att caa gat gac tgc 1562 
Leu Leu Asp He Ala Asn Tyr Leu Met Glu Gin He Gin Asp Asp Cys 
465 470 475 

act ggg gat gaa gat tac acc tat ttg att ctg egg gtc att gga aat 1610 
Thr Gly Asp Glu Asp Tyr Thr Tyr Leu He Leu Arg Val He Gly Asn 
480 485 490 

atg ggc caa acc atg gag cag tta act cca gaa etc aag tct tea ate 1658 
Met Gly Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser He 
495 500 505 510 

etc aaa tgt gtc caa agt aca aag cca tea ctg atg ate cag aaa get 1706 
Leu Lys Cys Val Gin Ser Thr Lys Pro Ser Leu Met He Gin Lys Ala 
515 520 525 


1754 


1802 
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545 550 555 

ctg get gec tat ctt atg ttg atg agg agt cct tea cag gca gat att 1850 
Leu Ala Ala Tyr Leu Met Leu Met Arg Ser Pro Ser Gin Ala Asp lie 
560 565 570 

aac aaa att gtc caa att eta cca tgg gaa cag aat gag caa gtg aag 1898 
Asn Lye He Val Gin He Leu Pro Trp Glu Gin Asn Glu Gin Val Lys 
575 580 585 590 

aac ttt gtg get tec cat att gee aat ate ttg aac tea gaa gaa ttg 1946 
Asn Phe Val Ala Ser His He Ala Asn He Leu Asn Ser Glu Glu Leu 
595 600 605 

gat ate caa gat ctg aaa aag tta gtg aaa gaa get ctg aaa gaa tct 1994 
Asp He Gin Asp Leu Lys Lys Leu Val Lys Glu Ala Leu Lys Glu Ser 
610 615 620 

caa ctt cca act gtc atg gac ttc aga aaa ttc tct egg aac tat caa 2042 
Gin Leu Pro Thr Val Met Asp Phe Arg Lys Phe Ser Arg Asn Tyr Gin 
625 630 635 

etc tac aaa tct gtt tct ctt cca tea ctt gac cca gec tea gee aaa 2090 
Leu Tyr Lys Ser Val Ser Leu Pro Ser Leu Asp Pro Ala Ser Ala Lys 
640 645 650 

ata gaa ggg aat ctt ata ttt gat cca aat aac tac ctt cct aaa gaa 2138 
He Glu Gly Asn Leu lie Phe Asp Pro Asn Asn Tyr Leu Pro Lys Glu 
655 660 665 670 

age atg ctg aaa act ace etc act gee ttt gga ttt get tea get gac 2186 
Ser Met Leu Lys Thr Thr Leu Thr Ala Phe Gly Phe Ala Ser Ala Asp 
675 680 685 

etc ate gag att ggc ttg gaa gga aaa ggc ttt gag cca aca ttg gaa 2234 
Leu He Glu lie Gly Leu Glu Gly Lys Gly Phe Glu Pro Thr Leu Glu 
690 695 700 

get ctt ttt ggg aag caa gga ttt ttc cca gac agt gtc aac aaa get 2282 
Ala Leu Phe Gly Lys Gin Gly Phe Phe Pro Asp Ser Val Asn Lys Ala 
705 710 715 

ttg tac tgg gtt aat ggt caa gtt cct gat ggt gtc tct aag gtc tta 2330 
Leu Tyr Trp Val Asn Gly Gin Val Pro Asp Gly Val Ser Lys Val Leu 
720 725 730 

gtg gac cac ttt ggc tat acc aaa gat gat aaa cat gag cag gat atg 2378 
Val Asp His Phe Gly Tyr Thr Lys Asp Asp Lys His Glu Gin Asp Met 
735 740 745 750 

gta aat gga ata atg etc agt gtt gag aag ctg att aaa gat ttg aaa 2426 
Val Asn Gly He Met Leu Ser Val Glu Lys Leu He Lys Asp Leu Lys 
755 760 765 

tec aaa gaa gtc ccg gaa gec aga gee tac etc cgc ate ttg gga gag 2474 
Ser Lys Glu Val Pro Glu Ala Arg Ala Tyr Leu Arg lie Leu Gly Glu 
770 775 780 
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gag ctt ggt ttt gcc agt etc cat gac etc cag etc ctg gga aag ctg 2522 
Glu Leu Gly Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu 
785 790 795 


ctt ctg atg ggt gcc cgc act ctg cag ggg ate ccc cag atg att gga 
Leu Leu Met Gly Ala Arg Thr Leu Gin Gly lie Pro Gin Met lie Gly 
800 80S 810 


ttc atg gag aat gcc ttt gaa etc ccc act gga get gga tta cag ttg 
Phe Met Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu 
835 840 845 


cct aca gga gag att gag cag tat tct gtc age gca acc tat gag etc 
Pro Thr Gly Glu He Glu Gin Tyr Ser Val Ser Ala Thr Tyr Glu Leu 


2570 


gag gtc ate agg aag ggc tea aag aat gac ttt ttt ctt cac tac ate 2618 
Glu Val He Arg Lys Gly Ser Lys Asn Asp Phe Phe Leu His Tyr He 
815 820 825 830 


2666 


caa ata tct tea tct gga gtc att get ccc gga gcc aag get gga gta 2714 
Gin He Ser Ser Ser Gly Val He Ala Pro Gly Ala Lys Ala Gly Val 
850 855 860 

aaa ctg gaa gta gcc aac atg cag get gaa ctg gtg gca aaa ccc tec 2762 
Lys Leu Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser 
865 870 875 

gtg tct gtg gag ttt gtg aca aat atg ggc ate ate att ccg gac ttc 2810 
Val Ser Val Glu Phe Val Thr Asn Met Gly He He He Pro Asp Phe 
880 885 890 

get agg agt ggg gtc cag atg aac acc aac ttc ttc cac gag teg ggt 2858 
Ala Arg Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly 
895 900 905 910 

ctg gag get cat gtt gcc eta aaa get ggg aag ctg aag ttt ate att 2906 
Leu Glu Ala His Val Ala Leu Lys Ala Gly Lys Leu Lys Phe He He 
915 920 925 

cct tec cca aag aga cca gtc aag ctg etc agt gga ggc aac aca tta 
Pro Ser Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu 
930 935 940 

cat ttg gtc tct acc acc aaa acg gag gtg ate cca cct etc att gag 
His Leu Val Ser Thr Thr Lys Thr Glu Val He Pro Pro Leu He Glu 
945 950 955 

aac agg cag tec tgg tea gtt tgc aag caa gtc ttt cct ggc ctg aat 
Asn Arg Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn 
960 965 970 

tac tgc acc tea ggc get tac tec aac gcc age tec aca gac tec gcc 
Tyr Cys Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala 
975 980 985 990 

tec tac tat ccg ctg acc ggg gac acc aga tta gag ctg gaa ctg agg 3146 
Ser Tyr Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg 
995 1000 1005 


2954 


3002 


3050 


3098 


3194 


WO 02/072604 PCT/US02/06728 


-44- 


1010 1015 1020 

cag aga gag gac aga gcc ttg gtg gat acc ctg aag ttt gta act caa 324 2 
Gin Arg Glu Asp Arg Ala Leu Val Asp Thr Leu Lys Phe Val Thr Gin 
1025 1030 1035 

gca gaa ggt gcg aag cag act gag get acc atg aca ttc aaa tat aat 3290 
Ala Glu Gly Ala Lys Gin Thr Glu Ala Thr Met Thr Phe Lys Tyr Asn 
1040 1045 1050 

egg cag agt atg acc ttg tec agt gaa gtc caa att ccg gat ttt gat 3338 
Arg Gin Ser Met Thr Leu Ser Ser Glu Val Gin lie Pro Asp Phe Asp 
1055 1060 1065 1070 

gtt gac etc gga aca ate etc aga gtt aat gat gaa tct act gag ggc 3386 
Val Asp Leu Gly Thr lie Leu Arg Val Asn Asp Glu Ser Thr Glu Gly 
1075 1080 1085 

aaa acg tct tac aga etc acc ctg gac att cag aac aag aaa att act 3434 
Lys Thr Ser Tyr Arg Leu Thr Leu Asp lie Gin Asn Lys Lys He Thr 
1090 1095 1100 


gag gtc gcc etc atg ggc cac eta agt tgt gac aca aag gaa gaa aga 
Glu Val Ala Leu Met Gly His Leu Ser Cys Asp Thr Lys Glu Glu Arg 
1105 1110 1115 


gac tea tct get aca get tat ggc tec aca gtt tec aag agg gtg gca 
Asp Ser Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala 
1155 1160 1165 


aat gta gat acc aaa aaa atg act tec aat ttc cct gtg gat etc tec 
Asn Val Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser 
1185 1190 1195 

gat tat cct aag age ttg cat atg tat get aat aga etc ctg gat cac 
Asp Tyr Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His 
1200 1205 1210 


1215 1220 


1235 


act 

ttc 

egg 

cac gtg ggt tec 

aaa 

tta 

Thr 

Phe 

Arg 

His Val Gly Ser 

Lys 

Leu 



1225 


1230 

ctt 

cag 

aag 

gca tct ggg agt 

ctt 

cct 

Leu 

Gin 

Lys 

Ala Ser Gly Ser 

Leu 

Pro 



1240 

1245 


3482 


aaa ate aag ggt gtt att tec ata ccc cgt ttg caa gca gaa gcc aga 3530 

Lys He Lys Gly Val He Ser He Pro Arg Leu Gin Ala Glu Ala Arg 
1120 1125 1130 

agt gag ate etc gcc cac tgg teg cct gcc aaa ctg ctt etc caa atg 3578 

Ser Glu He Leu Ala His Trp Ser Pro Ala Lys Leu Leu Leu Gin Met 

1135 1140 H45 1150 


3626 


tgg cat tat gat gaa gag aag att gaa ttt gaa tgg aac aca ggc acc 3674 
Trp His Tyr Asp Glu Glu Lys He Glu Phe Glu Trp Asn Thr Gly Thr 
1170 1175 1180 


3722 


3770 


3818 
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tat acc cag act ttg caa gac cac etc aat age ctg aag gag ttc aac 3 914 
Tyr Thr Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn 
1250 1255 1260 


etc cag aac atg gga ttg cca gac ttc cac ate cca gaa aac etc ttc 
Leu Gin Asn Met Gly Leu Pro Asp Phe His lie Pro Glu Asn Leu Phe 
1265 1270 1275 


cat gta gaa aaa ctt gga aac aac cca gtc tea aaa ggt tta eta ata 

His Val Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu He 
1440 1445 1450 

ttc gat gca tct agt tec tgg gga cca cag atg tct get tea gtt cat 

Phe Asp Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His 

1455 1460 1465 1470 

ttg gac tec aaa aag aaa cag cat ttg ttt gtc aaa gaa gtc aag att 

Leu Asp Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys He 


3962 


tta aaa age gat ggc egg gtc aaa tat acc ttg aac aag aac agt ttg 4010 
Leu Lys Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser Leu 
1280 1285 1290 

aaa att gag att cct ttg cct ttt ggt ggc aaa tec tec aga gat eta 4058 
Lys He Glu He Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu 
1295 1300 1305 1310 

aag atg tta gag act gtt agg aca cca gee etc cac ttc aag tct gtg 4106 
Lys Met Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val 
1315 1320 1325 

gga ttc cat ctg cca tct cga gag ttc caa gtc cct act ttt acc att 4154 
Gly Phe His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr He 


4202 


4250 


4298 


1330 1335 1340 

ccc aag ttg tat caa ctg caa gtg cct etc ctg ggt gtt eta gac etc 
Pro Lys Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu 
1345 1350 1355 

tec acg aat gtc tac age aac ttg tac aac tgg tec gee tec tac agt 
Ser Thr Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser 
1360 1365 1370 

ggt ggc aac acc age aca gac cat ttc age ctt egg get cgt tac cac 
Gly Gly Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His 
1375 1380 1385 1390 

atg aag get gac tct gtg gtt gac ctg ctt tec tac aat gtg caa gga 4346 
Met Lys Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly 
1395 1400 1405 

tct gga gaa aca aca tat gac cac aag aat acg ttc aca eta tea tgt 4394 
Ser Gly Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys 
1410 1415 1420 

gat ggg tct eta cgc cac aaa ttt eta gat teg aat ate aaa ttc agt 4442 
Asp Gly Ser Leu Arg His Lys Phe Leu Asp Ser Asn He Lys Phe Ser 
1425 1430 1435 


4490 


4538 


4586 
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1475 1480 1485 

gat ggg cag ttc aga gtc tct teg ttc tat get aaa ggc aca tat ggc 4634 
Asp Gly Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly 
1490 1495 1500 

ctg tct tgt cag agg gat cct aac act ggc egg etc aat gga gag tec 4682 
Leu Ser Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser 
1505 1510 1515 

aac ctg agg ttt aac tec tec tac etc caa ggc ace aac cag ata aca 
Asn Leu Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin He Thr 
1520 1525 1530 

qga aga tat gaa gat gga ace etc tec etc ace tec ace tct gat ctg 4778 
Glv Arg Tyr Glu Asp Gly Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu 
1535 1540 1545 1550 

caa agt ggc ate att aaa aat act get tec eta aag tat gag aac tac 4826 
Gin Ser Gly He He Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr 
1555 1560 1565 

gag ctg act tta aaa tct gac acc aat ggg aag tat aag aac ttt gec 
Glu Leu Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala 
1570 1575 1580 

act tct aac aag atg gat atg acc ttc tct aag caa aat gca ctg ctg 
Thr Ser Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu 
1585 1590 1595 

cgt tct gaa tat cag get gat tac gag tea ttg agg ttc ttc age ctg 
Arg Ser Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu 
1600 1605 1610 

ctt tct gga tea eta aat tec cat ggt ctt gag tta aat get gac ate 
Leu Ser Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp lie 
1615 1620 1625 1630 

tta ggc act gac aaa att aat agt ggt get cac aag gcg aca eta agg 
Leu Gly Thr Asp Lys He Asn Ser Gly Ala His Lys Ala Thr Leu Arg 
1635 1640 1645 

att ggc caa gat gga ata tct acc agt gca acg acc aac ttg aag tgt 5114 
He Glv Gin Asp Gly He Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys 
1 1650 1655 1660 

agt etc ctg gtg ctg gag aat gag ctg aat gca gag ctt ggc etc tct 5162 
Ser Leu Leu Val Leu Glu Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser 
1665 1670 1675 

qqg gca tct atg aaa tta aca aca aat ggc cgc ttc agg gaa cac aat 5210 
Gly Ala Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn 
1680 1685 1690 

qca aaa ttc agt ctg gat ggg aaa gec gee etc aca gag eta tea ctg 5258 
Ala Lys Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu 
1695 1700 1705 1710 
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gga agt get tat cag gec atg att ctg ggt gtc gac age aaa aac att 53 06 
Gly Ser Ala Tyr Gin Ala Met lie Leu Gly Val Asp Ser Lys Asn lie 
1715 1720 1725 

ttc aac ttc aag gtc agt caa gaa gga ctt aag etc tea aat gac atg 5354 
Phe Asn Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met 
1730 1735 1740 

atg ggc tea tat get gaa atg aaa ttt gac cac aca aac agt ctg aac 5402 
Met Gly Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn 
1745 1750 1755 

att gca ggc tta tea ctg gac ttc tct tea aaa ctt gac aac att tac 5450 
lie Ala Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn lie Tyr 
1760 1765 1770 

age tct gac aag ttt tat aag caa act gtt aat tta cag eta cag ccc 5498 
Ser Ser Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro 
1775 1780 1785 1790 

tat tct ctg gta act act tta aac agt gac ctg aaa tac aat get ctg 5546 
Tyr Ser Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu 
1795 1800 1B05 

gat etc acc aac aat ggg aaa eta egg eta gaa ccc ctg aag ctg cat 5594 
Asp Leu Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro Leu Lys Leu His 
1810 1815 1820 

gtg get ggt aac eta aaa gga gec tac caa aat aat gaa ata aaa cac 5642 
Val Ala Gly Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu lie Lys His 
1825 1830 1835 

ate tat gec ate tct tct get gec tta tea gca age tat aaa gca gac 5690 
He Tyr Ala He Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp 
1840 1845 1850 

act gtt get aag gtt cag ggt gtg gag ttt age cat egg etc aac aca 5738 
Thr Val Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr 
1855 I860 1865 1870 

gac ate get ggg ctg get tea gee att gac atg age aca aac tat aat 5786 
Asp He Ala Gly Leu Ala Ser Ala He Asp Met Ser Thr Asn Tyr Asn 
1875 1880 1885 

tea gac tea ctg cat ttc age aat gtc ttc cgt tct gta atg gec ccg 5834 
Ser Asp Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro 
1890 1895 1900 

ttt acc atg acc ate gat gca cat aca aat ggc aat ggg aaa etc get 5882 
Phe Thr Met Thr He Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala 
1905 1910 1915 

etc tgg gga gaa cat act ggg cag ctg tat age aaa ttc ctg ttg aaa 5930 
Leu Trp Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys 
1920 1925 1930 

gca gaa cct ctg gca ttt act ttc tct cat gat tac aaa ggc tec aca 5978 
Ala Glu Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr 
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1935 


1940 


1945 


1950 


agt cat cat etc gtg tct agg aaa age ate agt gca get ctt gaa cac 
Ser His His Leu Val Ser Arg Lys Ser He Ser Ala Ala Leu Glu His 
1955 I960 1965 

aaa gtc agt gec ctg ctt act cca get gag cag aca ggc acc tgg aaa 
Lys Val Ser Ala Leu Leu Thr Pro Ala Glu Gin Thr Gly Thr Trp Lys 
1970 1975 1980 

etc aag acc caa ttt aac aac aat gaa tac age cag gac ttg gat get 
Leu Lys Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala 
1985 1990 1995 

tac aac act aaa gat aaa att ggc gtg gag ctt act gga cga act ctg 
Tyr Asn Thr Lys Asp Lys He Gly Val Glu Leu Thr Gly Arg Thr Leu 
2000 2005 2010 

get gac eta act eta eta gac tec cca att aaa gtg cca ctt tta etc 
Ala Asp Leu Thr Leu Leu Asp Ser Pro He Lys Val Pro Leu Leu Leu 
2015 2020 2025 2030 

agt gag ccc ate aat ate att gat get tta gag atg aga gat gec gtt 
Ser Glu Pro He Asn He He Asp Ala Leu Glu Met Arg Asp Ala Val 
2035 2040 2045 

qaq aag ccc caa gaa ttt aca att gtt get ttt gta aag tat gat aaa 
Glu Lys Pro Gin Glu Phe Thr He Val Ala Phe Val Lys Tyr Asp Lys 
2050 2055 2060 

aac caa gat gtt cac tec att aac etc cca ttt ttt gag acc ttg caa 
Asn Gin Asp Val His Ser He Asn Leu Pro Phe Phe Glu Thr Leu Gin 
2065 2070 2075 

qaa tat ttt gag agg aat cga caa acc att ata gtt gta gtg gaa aac 
Glu Tyr Phe Glu Arg Asn Arg Gin Thr He He Val Val Val Glu Asn 
2080 2085 2090 

gta cag aga aac ctg aag cac ate aat att gat caa ttt gta aga aaa 
Val Gin Arg Asn Leu Lys His He Asn He Asp Gin Phe Val Arg Lys 
2095 2100 2105 2110 

tac aga gca gee ctg gga aaa etc cca cag caa get aat gat tat ctg 
Tvr Arq Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu 
* 2115 2120 2125 

aat tea ttc aat tgg gag aga caa gtt tea cat gee aag gag aaa ctg 
Asn Ser Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu 
2130 2135 2140 

act get etc aca aaa aag tat aga att aca gaa aat gat ata caa att 
Thr Ala Leu Thr Lys Lys Tyr Arg He Thr Glu Asn Asp He Gin He 
2145 2150 2155 

gca tta gat gat gee aaa ate aac ttt aat gaa aaa eta tct caa ctg 
Ala Leu Asp Asp Ala Lys He Asn Phe Asn Glu Lys Leu Ser Gin Leu 
2160 2165 2170 
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cag aca Cat atg ata caa ttt gat cag tat att aaa gat agt tat gat 6698 
Gin Thr Tyr Met He Gin Phe Asp Gin Tyr He Lys Asp Ser Tyr Asp 
2175 2180 2185 2190 

tta cat gat ttg aaa ata get att get aat att att gat gaa ate att 
Leu His Asp Leu Lys He Ala He Ala Asn He He Asp Glu He He 
2195 2200 2205 

gaa aaa tta aaa agt ctt gat gag cac tat cat ate cgt gta aat tta 
Glu Lys Leu Lys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu 
2210 2215 2220 

gta aaa aca ate cat gat eta cat ttg ttt att gaa aat att gat ttt 
Val Lys Thr He His Asp Leu His Leu Phe He Glu Asn He Asp Phe 
2225 2230 2235 

aac aaa agt gga agt agt act gca tec tgg att caa aat gtg gat act 
Asn Lys Ser Gly Ser Ser Thr Ala Ser Trp He Gin Asn Val Asp Thr 
2240 2245 2250 

aag tac caa ate aga ate cag ata caa gaa aaa ctg cag cag ctt aag 
Lys Tyr Gin He Arg lie Gin He Gin Glu Lys Leu Gin Gin Leu Lys 
2255 2260 2265 2270 

aga cac ata cag aat ata gac ate cag cac eta get gga aag tta aaa 
Arq His He Gin Asn He Asp He Gin His Leu Ala Gly Lys Leu Lys 
2275 2280 2285 

caa cac att gag get att gat gtt aga gtg ctt tta gat caa ttg gga 
Gin His He Glu Ala He Asp Val Arg Val Leu Leu Asp Gin Leu Gly 
2290 2295 2300 

act aca att tea ttt gaa aga ata aat gat gtt ctt gag cat gtc aaa 
Thr Thr He Ser Phe Glu Arg He Asn Asp Val Leu Glu His Val Lys 
2305 2310 2315 

cac ttt gtt ata aat ctt att ggg gat ttt gaa gta get gag aaa ate 7130 
His Phe Val He Asn Leu He Gly Asp Phe Glu Val Ala Glu Lys He 
2320 2325 2330 

aat gee ttc aga gee aaa gtc cat gag tta ate gag agg tat gaa gta 7178 
Asn Ala Phe Arg Ala Lys Val His Glu Leu He Glu Arg Tyr Glu Val 
2335 2340 2345 2350 

gac caa caa ate cag gtt tta atg gat aaa tta gta gag ttg ace cac 
Asp Gin Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His 
2355 2360 2365 

caa tac aag ttg aag gag act att cag aag eta age aat gtc eta caa 7274 
Gin Tvr Lvs Leu Lys Glu Thr He Gin Lys Leu Ser Asn Val Leu Gin 
y ^ 2370 2375 2380 


caa gtt aag ata aaa gat tac ttt gag aaa ttg gtt gga ttt att gat 
Gin Val Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe He Asp 
2385 2390 2395 

gat get gtg aag aag ctt aat gaa tta tct ttt aaa aca ttc att gaa 
Asp Ala Val Lys Lys Leu Asn Glu Leu Ser Phe Lys Thr Phe He Glu 
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2400 2405 2410 

gat gtt aac aaa ttc ctt gac atg ttg ata aag aaa tta aag tea ttt 
Asp Val Asn Lys Phe Leu Asp Met Leu He Lys Lys Leu Lys Ser Phe 
2415 2420 2425 2430 

gat tac cac cag ttt gta gat gaa acc aat gac aaa ate cgt gag gtg 
Asp Tyr His Gin Phe Val Asp Glu Thr Asn Asp Lys He Arg Glu Val 
2435 2440 2445 

act cag aga etc aat ggt gaa att cag get ctg gaa eta cca caa aaa 
Thr Gin Arg Leu Asn Gly Glu He Gin Ala Leu Glu Leu Pro Gin Lys 
2450 2455 2460 

get gaa gca tta aaa ctg ttt tta gag gaa acc aag gee aca gtt gca 
Ala Glu Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala 
2465 2470 2475 

gtg tat ctg gaa age eta cag gac acc aaa ata acc tta ate ate aat 
Val Tyr Leu Glu Ser Leu Gin Asp Thr Lys He Thr Leu He He Asn 
2480 2485 2490 

tag tta cag gag get tta agt tea gca tct ttg get cac atg aag gee 
Trp Leu Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala 
- — - 2500 2505 2510 


2495 


aaa ttc cga gag act eta gaa gat aca cga gac cga atg tat caa atg 
Lys Phe Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met 
y * 2515 2520 2525 

gac att cag cag gaa ctt caa cga tac ctg tct ctg gta ggc cag gtt 
Asp lie Gin Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val 
F 2530 2535 2540 

tat age aca ctt gtc acc tac att tct gat tgg tgg act ctt get get 
Tyr Ser Thr Leu Val Thr Tyr He Ser Asp Trp Trp Thr Leu Ala Ala 
2545 2550 2555 

aag aac ctt act gac ttt gca gag caa tat tct ate caa gat tgg get 
Lys Asn Leu Thr Asp Phe Ala Glu Gin Tyr Ser He Gin Asp Trp Ala 
2560 2565 2570 

aaa cgt atg aaa gca ttg gta gag caa ggg ttc act gtt cct gaa ate 
Lys Arg Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu He 
2575 2580 2585 2590 

aag acc ate ctt ggg acc atg cct gee ttt gaa gtc agt ctt cag get 
Lvs Thr He Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala 
* 2595 2600 2605 


ctt cag aaa get acc ttc cag aca cct gat ttt ata gtc ccc eta aca 
Leu Gin Lys Ala Thr Phe Gin Thr Pro Asp Phe He Val Pro Leu Thr 
2610 2615 2620 

qat ttg agg att cca tea gtt cag ata aac ttc aaa gac tta aaa aat 
Aso Leu Arg He Pro Ser Val Gin He Asn Phe Lys Asp Leu Lys Asn 

* ^^f- ->£\(\ 2635 
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ata aaa ate cca tec agg ttt tec aca cca gaa ttt acc ate ctt aac 
He Lys He Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr He Leu Asn 
2640 2645 2650 

acc ttc cac att cct tec ttt aca att gac ttt gtc gaa atg aaa gta 
Thr Phe His He Pro Ser Phe Thr He Asp Phe Val Glu Met Lys Val 
2655 2660 2665 2670 

aag ate ate aga acc att gac cag atg cag aac agt gag ctg cag tgg 
Lys He He Arg Thr He Asp Gin Met Gin Asn Ser Glu Leu Gin Trp 
2675 2680 2685 

ccc gtt cca gat ata tat etc agg gat ctg aag gtg gag gac att cct 
Pro Val Pro Asp He Tyr Leu Arg Asp Leu Lys Val Glu Asp He Pro 
2690 2695 2700 

eta gcg aga ate acc ctg cca gac ttc cgt tta cca gaa ate gca att 
Leu Ala Arg He Thr Leu Pro Asp Phe Arg Leu Pro Glu He Ala He 
2705 2710 2715 

cca gaa ttc ata ate cca act etc aac ctt aat gat ttt caa gtt cct 
Pro Glu Phe He He Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro 
2720 2725 2730 

gac ctt cac ata cca gaa ttc cag ctt ccc cac ate tea cac aca att 
Asp Leu His He Pro Glu Phe Gin Leu Pro His He Ser His Thr He 
2735 2740 2745 2750 

gaa gta cct act ttt ggc aag eta tac agt att ctg aaa ate caa tct 
Glu Val Pro Thr Phe Gly Lys Leu Tyr Ser He Leu Lys He Gin Ser 
2755 2760 2765 

cct ctt ttc aca tta gat gca aat get gac ata ggg aat gga acc acc 
Pro Leu Phe Thr Leu Asp Ala Asn Ala Asp He Gly Asn Gly Thr Thr 
2770 2775 2780 

tea gca aac gaa gca ggt ate gca get tec ate act gee aaa gga gag 
Ser Ala Asn Glu Ala Gly He Ala Ala Ser He Thr Ala Lys Gly Glu 
2785 2790 2795 

tec aaa tta gaa gtt etc aat ttt gat ttt caa gca aat gca caa etc 
Ser Lys Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu 
2800 2805 2810 

tea aac cct aag att aat ccg ctg get ctg aag gag tea gtg aag ttc 
Ser Asn Pro Lys He Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe 
2815 2820 2825 2830 

tec age aag tac ctg aga acg gag cat ggg agt gaa atg ctg ttt ttt 
Ser Ser Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe 
2835 2840 2845 

gga aat get att gag gga aaa tea aac aca gtg gca agt tta cac aca 
Gly Asn Ala He Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr 
2850 2855 2860 

gaa aaa aat aca ctg gag ctt agt aat gga gtg att gtc aag ata aac 
Glu Lys Asn Thr Leu Glu Leu Ser Asn Gly Val He Val Lys He Asn 
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2865 2870 2875 

aat cag ctt acc ctg gat age aac act aaa tac ttc cac aaa ttg aac 8810 
Asn Gin Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn 
2880 2885 2890 

ate ccc aaa ctg gac ttc tct agt cag get gac ctg cgc aac gag ate 8858 
lie Pro Lys Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu He 
2895 2900 2905 2910 

aag aca ctg ttg aaa get ggc cac ata gca tgg act tct tct gga aaa 8906 
Lys Thr Leu Leu Lys Ala Gly His He Ala Trp Thr Ser Ser Gly Lys 
2915 2920 2925 

ggg tea tgg aaa tgg gee tgc ccc aga ttc tea gat gag gga aca cat 8954 
Gly Ser Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His 
2930 2935 2940 

gaa tea caa att agt ttc acc ata gaa gga ccc etc act tec ttt gga 9002 
Glu Ser Gin He Ser Phe Thr He Glu Gly Pro Leu Thr Ser Phe Gly 
2945 2950 2955 

ctg tec aat aag ate aat age aaa cac eta aga gta aac caa aac ttg 
Leu Ser Asn Lys He Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu 
2960 2965 2970 

gtt tat gaa tct ggc tec etc aac ttt tct aaa ctt gaa att caa tea 
Val Tyr Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu He Gin Ser 
2975 2980 2985 2990 

caa gtc gat tec cag cat gtg ggc cac agt gtt eta act get aaa ggc 
Gin Val Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly 
2995 3000 3005 

atg gca ctg ttt gga gaa ggg aag gca gag ttt act ggg agg cat gat 9194 
Met Ala Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr Gly Arg His Asp 
3010 3015 3020 


get cat tta aat gga aag gtt att gga act ttg aaa aat tct ctt ttc 
Ala His Leu Asn Gly Lys Val He Gly Thr Leu Lys Asn Ser Leu Phe 
3025 3030 3035 

ttt tea gee cag cca ttt gag ate acg gca tec aca aac aat gaa ggg 
Phe Ser Ala Gin Pro Phe Glu He Thr Ala Ser Thr Asn Asn Glu Gly 
3040 3045 3050 

aat ttg aaa gtt cgt ttt cca tta agg tta aca ggg aag ata gac ttc 
Asn Leu Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys He Asp Phe 
3055 3060 3065 3070 


ctg aat aac tat gca ctg ttt ctg agt ccc agt gee cag caa gca agt 
Leu Asn Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser 
3075 3080 3085 

tgg caa gta agt get agg ttc aat cag tat aag tac aac caa aat ttc 
Trp Gin Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe 
3090 3095 3100 


9050 


9098 


9146 


9242 


9290 


9338 


9386 


9434 
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tct get gga aac aac gag aac att atg gag gec cat gta gga ata aat 

Ser Ala Gly Asn Asn Glu Asn He Met Glu Ala His Val Gly He Asn 
3105 3110 311S 

gga gaa gca aat ctg gat ttc tta aac att cct tta aca att cct gaa 

Gly Glu Ala Asn Leu Asp Phe Leu Asn He Pro Leu Thr He Pro Glu 
3120 3125 3130 


atg cgt eta cct tac aca ata ate aca act cct cca ctg aaa gat ttc 
Met Arg Leu Pro Tyr Thr He He Thr Thr Pro Pro Leu Lys Asp Phe 
3135 3140 3145 315 


3140 


50 


tct eta tgg gaa aaa aca ggc ttg aag gaa ttc ttg aaa acg aca aag 
Ser Leu Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys 
3155 3160 3165 

caa tea ttt gat tta agt gta aaa get cag tat aag aaa aac aaa cac 
Gin Ser Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His 
3170 3175 3180 

agg cat tec ate aca aat cct ttg get gtg ctt tgt gag ttt ate agt 
Arq His Ser He Thr Asn Pro Leu Ala Val Leu Cys Glu Phe He Ser 
3185 3190 3195 

cag age ate aaa tec ttt gac agg cat ttt gaa aaa aac aga aac aat 
Gin Ser He Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn 
3200 3205 3210 

gca tta gat ttt gtc acc aaa tec tat aat gaa aca aaa att aag ttt 
Ala Leu Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys He Lys Phe 


3215 


3220 


3225 


3230 


gat aag tac aaa get gaa aaa tct cac gac gag etc ccc agg acc ttt 
Asp Lys Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe 
3235 3240 3245 

caa att cct gga tac act gtt cca gtt gtc aat gtt gaa gtg tct cca 
Gin lie Pro Gly Tyr Thr Val Pro Val Val Asn Val Glu Val Ser Pro 
3250 3255 3260 

ttc acc ata gag atg teg gca ttc ggc tat gtg ttc cca aaa gca gtc 
Phe Thr He Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val 
3265 3270 3275 

aac atg cct agt ttc tec ate eta ggt tct gac gtc cgt gtg cct tea 
Ser Met Pro Ser Phe Ser He Leu Gly Ser Asp Val Arg Val Pro Ser 
3280 3285 3290 

tac aca tta ate ctg cca tea tta gag ctg cca gtc ctt cat gtc cct 
Tvr Thr Leu He Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro 
3295 3300 3305 3310 

aqa aat etc aag ctt tct ctt cca cat ttc aag gaa ttg tgt acc ata 
Ara Asn Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr lie 
* 3315 3320 3325 

aqc cat att ttt att cct gee atg ggc aat att acc tat gat ttc tec 
Ser His He Phe He Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser 


9482 


9530 


9578 


9626 


9674 


9722 


9770 


9818 


9866 


9914 


9962 


10010 


10058 


10106 


10154 
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3330 3335 3340 


ttt 

Phe 

aaa 
Lys 

tea agt 
Ser Ser 
3345 

gtc 
Val 

ate 
He 

aca 
Thr 

ctg aat 
Leu Asn 
3350 

acc 
Thr 

aat 
Asn 

get 
Ala 

gaa ctt 
Glu Leu 
3355 

ttt 
Phe 

aac 
Asn 

10202 

cag 
Gin 

tea gat 
Ser Asp 
3360 

att 
He 

gtt 

Val 

get 
Ala 

cat etc 
His Leu 
3365 

ctt 
Leu 

tct 
Ser 

tea 
Ser 

tct tea 
Ser Ser 
3370 

tct 
Ser 

gtc 
Val 

att 
He 

10250 

gat gca 
Asp Ala 
3375 

ctg 
Leu 

cag 
Gin 

tac 
Tyr 

aaa tta 
Lys Leu 
3380 

gag 
Glu 

ggc 

Gly 

acc 
Thr 

aca aga 
Thr Arg 
3385 

ttg 
Leu 

aca 
Thr 

aga 
Arg 

aaa 
Lys 
3390 

10298 

agg gga 
Arg Gly 

ttg 
Leu 

aag 
Lys 

tta gee 
Leu Ala 
3395 

aca 
Thr 

get 
Ala 

ctg 
Leu 

tct ctg 
Ser Leu 
3400 

age 
Ser 

aac 
Asn 

aaa 
Lys 

ttt gtg 
Phe Val 
3405 

10346 

gag ggt 
Glu Gly 

agt 
Ser 

eat aac 
His Asn 
3410 

agt 
Ser 

act 
Thr 

gtg 
Val 

age tta 
Ser Leu 
3415 

acc 
Thr 

acg 
Thr 

aaa 
Lys 

aat atg 
Asn Met 
3420 

gaa 
Glu 

10394 

gtg 
Val 

tea 
Ser 

gtg gca 
Val Ala 
3425 

aaa 
Lys 

acc 
Thr 

aca 
Thr 

aaa gec 
Lys Ala 
3430 

gaa 

Glu 

att 
He 

cea 
Pro 

att ttg 
He Leu 
3435 

aga 
Arg 

atg 
Met 

10442 

aat 
Asn 

ttc aag 
Phe Lys 
3440 

caa 
Gin 

gaa 
Glu 

ctt 
Leu 

aat gga 
Asn Gly 
3445 

aat 
Asn 

acc 
Thr 

aag 
Lys 

tea aaa 
Ser Lys 
3450 

cct 
Pro 

act 
Thr 

gtc 
Val 

10490 

tct tec 
Ser Ser 
3455 

tec 
Ser 

atg 
Met 

gaa 
Glu 

ttt aag 
Phe Lys 
3460 

tat 
Tyr 

gat 
Asp 

ttc 
Phe 

aat tct 
Asn Ser 
3465 

tea 
Ser 

atg 
Met 

ctg 
Leu 

tac 
Tyr 
3470 

10538 

tct 
Ser 

acc 
Thr 

get 
Ala 

aaa 
Lys 

gga gca 
Gly Ala 
3475 

gtt 
Val 

gac 
Asp 

cac 
His 

aag ctt 
Lys Leu 
3480 

age 
Ser 

ttg 
Leu 

gaa 
Glu 

age etc 
Ser Leu 
3485 

10586 

acc 
Thr 

tct 
Ser 

tac 
Tyr 

ttt tec 
Phe Ser 
3490 

att 
He 

gag 
Glu 

tea 
Ser 

tct acc 
Ser Thr 
3495 

aaa 
Lys 

gga 

Gly 

gat 
Asp 

gtc aag 
Val Lys 
3500 

ggt 
Gly 

10634 

teg 
Ser 

gtt 
Val 

ctt tct 
Leu Ser 
3505 

egg 
Arg 

gaa 
Glu 

tat 
Tyr 

tea gga 
Ser Gly 
3510 

act 
Thr 

att 
He 

get 
Ala 

agt gag 
Ser Glu 
3515 

gec 
Ala 

aac 
Asn 

10682 

act 
Thr 

tac ttg 
Tyr Leu 
3520 

aat 
Asn 

tec 
Ser 

aag 
Lys 

age aca egg 
Ser Thr Arg 
3525 

tct 
Ser 

tea 
Ser 

gtg aag 
Val Lys 
3530 

ctg 
Leu 

cag 
Gin 

ggc 
Gly 

10730 

act tec aaa 
Thr Ser Lys 
3535 

att 
He 

gat 
Asp 

gat ate tgg aac 
Asp He Trp Asn 
3540 

ctt 
Leu 

gaa gta 
Glu Val 
3545 

aaa 
Lys 

gaa 
Glu 

aat 
Asn 

ttt 
Phe 
3550 

10778 

get gga gaa gee 
Ala Gly Glu Ala 

aca etc caa cgc 
Thr Leu Gin Arg 
3555 

ata 
He 

tat tec 
Tyr Ser 
3560 

etc tgg 
Leu Trp 

gag 
Glu 

cac agt 
His Ser 
3565 

10826 
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acg aaa aac cac tta cag eta gag ggc etc ttt ttc acc aac gga gaa 
Thr Lys Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu 
3570 3575 3580 

cat aca age aaa gee acc ctg gaa etc tct cca tgg caa atg tea get 
His Thr Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala 
3585 3590 3595 

ctt gtt cag gtc cat gca agt cag ccc agt tec ttc cat gat ttc cct 
Leu Val Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro 
3600 3605 3610 

gac ctt ggc cag gaa gtg gee ctg aat get aac act aag aac cag aag 
Asd Leu Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys 
3615 3620 3625 3630 

ate aga tgg aaa aat gaa gtc egg att cat tct ggg tct ttc cag age 
He Arg Trp Lys Asn Glu Val Arg He His Ser Gly Ser Phe Gin Ser 
3635 3640 3645 

cag gtc gag ctt tec aat gac caa gaa aag gca cac ctt gac att gca 
Gin Val Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp He Ala 
3650 3655 3660 

gga tec tta gaa gga cac eta agg ttc etc aaa aat ate ate eta cca 
Gly Ser Leu Glu Gly His Leu Arg Phe Leu Lys Asn He He Leu Pro 


3665 


3670 


3675 


gtc tat gac aag age tta tgg gat ttc eta aag ctg gat gta acc acc 
Val Tyr Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr 
3680 3685 3690 

aac att gat agg aga cag cat ctt cgt gtt tea act gee ttt gtg tac 
Ser He Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr 
3695 3700 3705 3710 

acc aaa aac ccc aat ggc tat tea ttc tec ate cct gta aaa gtt ttg 
Thr Lys Asn Pro Asn Gly Tyr Ser Phe Ser He Pro Val Lys Val Leu 
3715 3720 3725 

get gat aaa ttc att act cct ggg ctg aaa eta aat gat eta aat tea 
Ala Asp Lys Phe He Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser 
3730 3735 3740 

qtt ctt gtc atg cct acg ttc cat gtc cca ttt aca gat ctt cag gtt 
Val Leu Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val 
3745 3750 3755 

cca teg tgc aaa ctt gac ttc aga gaa ata caa ate tat aag aag ctg 
Pro Ser Cys Lys Leu Asp Phe Arg Glu He Gin He Tyr Lys Lys Leu 
3760 3765 3770 

aga act tea tea ttt gec etc aac eta cca aca etc ccc gag gta aaa 
Ara Thr Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys 
3775 3780 3785 3790 

ttc cct gaa gtt gat gtg tta aca aaa tat tct caa cca gaa gac tec 
Phe Pro Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser 


10874 


10922 


10970 


11018 


11066 


11114 


11162 


11210 


11258 


11306 


11354 


11402 


11450 


11498 
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3795 


3800 


3805 


ttg att ccc ttt ttt gag ata acc gtg cct gaa tct cag tta act gtg 
Leu lie Pro Phe Phe Glu He Thr Val Pro Glu Ser Gin Leu Thr Val 
3810 3815 3820 


11594 


tec cag ttc acg ctt cca aaa agt gtt tea gat ggc att get get ttg 
Ser Gin Phe Thr Leu Pro Lys Ser Val Ser Asp Gly He Ala Ala Leu 
3825 3830 3835 

gat eta aat gca gta gee aac aag ate gca gac ttt gag ttg ccc acc 
Asp Leu Asn Ala Val Ala Asn Lys He Ala Asp Phe Glu Leu Pro Thr 
3840 3845 3850 

ate ate gtg cct gag cag acc att gag att ccc tec att aag ttc tct 
He He Val Pro Glu Gin Thr He Glu He Pro Ser He Lys Phe Ser 
3855 3860 3865 3870 

gta cct get gga att gtc att cct tec ttt caa gca ctg act gca cgc 
Val Pro Ala Gly He Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg 
3875 3880 3885 

ttt gag gta gac tct ccc gtg tat aat gec act tgg agt gee agt ttg 
Phe Glu Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu 
3890 3895 3900 

aaa aac aaa gca gat tat gtt gaa aca gtc ctg gat tec aca tgc age 
Lys Asn Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser 
3905 3910 3915 

tea acc gta cag ttc eta gaa tat gaa eta aat gtt ttg gga aca cac 
Ser Thr Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His 
3920 3925 3930 

aaa ate gaa gat ggt acg tta gee tct aag act aaa gga aca ctt gca 
Lys He Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala 
3935 3940 3945 3950 

cac cgt gac ttc agt gca gaa tat gaa gaa gat ggc aaa ttt gaa gga 
His Arg Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly 
3955 3960 3965 

ctt cag gaa tgg gaa gga aaa gcg cac etc aat ate aaa age cca gcg 
Leu Gin Glu Trp Glu Gly Lys Ala His Leu Asn He Lys Ser Pro Ala 
3970 3975 3980 

ttc acc gat etc cat ctg cgc tac cag aaa gac aag aaa ggc ate tec 
Phe Thr Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly He Ser 
3985 3990 3995 

acc tea gca gee tec cca gee gta ggc acc gtg ggc atg gat atg gat 
Thr Ser Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp 
4000 4005 4010 

gaa gat gac gac ttt tct aaa tgg aac ttc tac tac age cct cag tec 
Glu Asp Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser 
4015 4020 4025 4030 


11642 


11690 


11738 


11786 


11834 


11882 


11930 


11978 


12026 


12074 


12122 
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tct cca gat aaa aaa etc acc ata ttc aaa act gag ttg agg gtc egg 
Ser Pro Asp Lys Lys Leu Thr lie Phe Lye Thr Glu Leu Arg Val Arg 
403S 4040 4045 

gaa tct gat gag gaa act cag ate aaa gtt aat tgg gaa gaa gag gca 
Glu Ser Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala 
4050 4055 4060 

get tct ggc ttg eta acc tct ctg aaa gac aac gtg ccc aag gec aca 
Ala Ser Gly Leu Leu Thr Ser Leu Lys Asp Asn Val Pro Lys Ala Thr 
4065 4070 4075 

ggg gtc ctt tat gat tat gtc aac aag tac cac tgg gaa cac aca ggg 
Gly Val Leu Tyr Asp Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly 
4080 4085 4090 

etc acc ctg aga gaa gtg tct tea aag ctg aga aga aat ctg cag aac 
Leu Thr Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn 
4095 4100 4105 4110 

aat get gag tgg gtt tat caa ggg gee att agg caa att gat gat ate 
Asn Ala Glu Trp Val Tyr Gin Gly Ala lie Arg Gin He Asp Asp He 
4115 4120 4125 

gac gtg agg ttc cag aaa gca gec agt ggc acc act ggg acc tac caa 
Asp Val Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin 
4130 4135 4140 

gag tgg aag gac aag gec cag aat ctg tac cag gaa ctg ttg act cag 
Glu Trp Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin 
4145 4150 4155 

gaa ggc caa gec agt ttc cag gga etc aag gat aac gtg ttt gat ggc 
Glu Gly Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly 
4160 4165 4170 

ttq gta cga gtt act caa aaa ttc cat atg aaa gtc aag cat ctg att 
Leu Val Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu He 
4175 4180 4185 4190 

gac tea etc att gat ttt ctg aac ttc ccc aga ttc cag ttt ccg ggg 
Asp Ser Leu He Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly 
* 4195 4200 4205 

aaa cct ggg ata tac act agg gag gaa ctt tgc act atg ttc ata agg 
Lvs Pro Gly He Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe He Arg 
4210 ' 4215 4220 

qaq gta ggg acg gta ctg tec cag gta tat teg aaa gtc cat aat ggt 
Glu Val Gly Thr Val Leu Ser Gin Val Tyr Ser Lys Val His Asn Gly 
4225 4230 4235 

tea gaa ata ctg ttt tec tat ttc caa gac eta gtg att aca ctt cct 
Ser Glu He Leu Phe Ser Tyr Phe Gin Asp Leu Val He Thr Leu Pro 
4240 4245 4250 

ttc gag tta agg aaa cat aaa eta ata gat gta ate teg atg tat agg 
Phe Glu Leu Arg Lys His Lys Leu He Asp Val lie Ser Met Tyr Arg 
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4255 


4260 


4265 


4270 


gaa ctg ttg aaa gat tta tea aaa gaa gec caa gag gta ttt aaa gec 
Glu Leu Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala 
4275 4280 4285 


12986 


att cag tct etc aag acc aca gag gtg eta cgt aat ctt cag gac ctt 13034 
lie Gin Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu 
4290 4295 4300 

tta caa ttc- att ttc caa eta ata gaa gat aac att aaa cag ctg aaa 13082 
Leu Gin Phe lie Phe Gin Leu lie Glu Asp Asn lie Lys Gin Leu Lys 
4305 4310 4315 

gag atg aaa ttt act tat ctt att aat tat ate caa gat gag ate aac 13130 
Glu Met Lys Phe Thr Tyr Leu lie Asn Tyr lie Gin Asp Glu lie Asn 
4320 4325 4330 

aca ate ttc aat gat tat ate cca tat gtt ttt aaa ttg ttg aaa gaa 13178 
Thr He Phe Asn Asp Tyr He Pro Tyr Val Phe Lys Leu Leu Lys Glu 
4335 4340 4345 4350 

aac eta tgc ctt aat ctt cat aag ttc aat gaa ttt att caa aac gag 13226 
Asn Leu Cys Leu Asn Leu His Lys Phe Asn Glu Phe He Gin Asn Glu 
4355 4360 4365 

ctt cag gaa get tct caa gag tta cag cag ate cat caa tac att atg 13274 
Leu Gin Glu Ala Ser Gin Glu Leu Gin Gin He His Gin Tyr He Met 
4370 4375 4380 

gec ctt cgt gaa gaa tat ttt gat cca agt ata gtt ggc tgg aca gtg 13322 
Ala Leu Arg Glu Glu Tyr Phe Asp Pro Ser He Val Gly Trp Thr Val 
4385 4390 4395 

aaa tat tat gaa ctt gaa gaa aag ata gtc agt ctg ate aag aac ctg 13370 
Lys Tyr Tyr Glu Leu Glu Glu Lys He Val Ser Leu He Lys Asn Leu 
4400 4405 4410 

tta gtt get ctt aag gac ttc cat tct gaa tat att gtc agt gee tct 13418 
Leu Val Ala Leu Lys Asp Phe His Ser Glu Tyr He Val Ser Ala Ser 
4415 4420 4425 4430 

aac ttt act tec caa etc tea agt caa gtt gag caa ttt ctg cac aga 13466 
Asn Phe Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg 
4435 4440 4445 


aat att cag gaa tat ctt age ate ctt acc gat cca gat gga aaa ggg 13514 
Asn lie Gin Glu Tyr Leu Ser He Leu Thr Asp Pro Asp Gly Lys Gly 
4450 4455 4460 

aaa gag aag att gca gag ctt tct gec act get cag gaa ata att aaa 13562 
Lys Glu Lys He Ala Glu Leu Ser Ala Thr Ala Gin Glu He He Lys 
4465 4470 4475 

age cag gee att gcg acg aag aaa ata att tct gat tac cac cag cag 13610 
Ser Gin Ala He Ala Thr Lys Lys He He Ser Asp Tyr His Gin Gin 
4480 4485 4490 
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ttt aga tat aaa ctg caa gat ttt tea gac caa etc tct gat tac tat 
Phe Arq Tyr Lys Leu Gin Asp Phe Ser Asp Gin Leu Ser Asp Tyr Tyr 
4495 4500 4505 4510 

gaa aaa ttt att get gaa tec aaa aga ttg att gac ctg tec att caa 
Glu Lys Phe He Ala Glu Ser Lys Arg Leu He Asp Leu Ser He Gin 
45X5 4520 4525 

aac tac cac aca ttt ctg ata tac ate acg gag tta ctg aaa aag ctg 
Asn Tyr His Thr Phe Leu He Tyr He Thr Glu Leu Leu Lys Lys Leu 
4530 4535 4540 

caa tea acc aca gtc atg aac ccc tac atg aag ctt get cca gga gaa 
Gin Ser Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu 
4545 4550 4555 

ctt act ate ate etc taa ttttttaaaa gaaatcttca tttattcttc 
Leu Thr He He Leu * 
4560 

ttttccaatt gaactttcac atagcacaga aaaaattcaa actgectata ttgataaaac 
catacagtga gccagccttg cagtaggcag tagactataa gcagaagcac atatgaactg 
gacctgcacc aaagctggca ccagggctcg gaaggtctct gaactcagaa 99atggcatt 
Etttgcaagt taaagaaaat caggatctga gttattttgc taaacttggg ggaggaggaa 
caaataaatg gagtctttat tgtgtatcat a 

<210> 32 
<211> 4563 
<212> PRT 
<213> Homo sapien 

Me2°Asp 2 pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu Pro Ala 

Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu Met Leu 

20 25 
Glu Asn Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe Lys His 

35 40 45 

Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser Gly Val 

50 55 60 

Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg He Asn Cys Lys Val 
65 70 75 80 

Glu Leu Glu Val Pro Gin Leu Cys Ser Phe He Leu Lys Thr Ser Gin 

85 90 95 

Cys Thr Leu Lys Glu Val Tyr Gly Phe Asn Pro Glu Gly Lys Ala Leu 

Y ioo 105 HO 

Leu Lys Lys Thr Lys Asn Ser Glu Glu Phe Ala Ala Ala Met Ser Arg 

115 120 125 

Tyr Glu Leu Lys Leu Ala He Pro Glu Gly Lys Gin Val Phe Leu Tyr 

130 135 140 

Pro Glu Lys Asp Glu Pro Thr Tyr He Leu Asn He Lys Arg Gly lie 

III Ser Ala Leu Leu Val Pro Pro Glu Thr Glu Glu Ala Lys Gin Val 

165 170 175 

Leu Phe Leu Asp Thr Val Tyr Gly Asn Cys Ser Thr His Phe Thr Val 

180 185 I 90 

Lvs Thr Arg Lys Gly Asn Val Ala Thr Glu He Ser Thr Glu Arg Asp 
y 19 5 200 205 
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Leu Gly Gin Cys Asp Arg Phe Lys Pro lie Arg Thr Gly lie Ser Pro 

210 215 220 

Leu Ala Leu He Lys Gly Met Thr Arg Pro Leu Ser Thr Leu He Ser 
225 230 235 240 

Ser Ser Gin Ser Cys Gin Tyr Thr Leu Asp Ala Lys Arg Lys His Val 

245 250 255 

Ala Glu Ala He Cys Lys Glu Gin His Leu Phe Leu Pro Phe Ser Tyr 

260 265 270 

Asn Asn Lys Tyr Gly Met Val Ala Gin Val Thr Gin Thr Leu Lys Leu 

275 280 285 

Glu Asp Thr Pro Lys He Asn Ser Arg Phe Phe Gly Glu Gly Thr Lys 

290 295 300 

Lys Met Gly Leu Ala Phe Glu Ser Thr Lys Ser Thr Ser Pro Pro Lys 
305 310 315 320 

Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys Leu Thr 

325 330 335 

He Ser Glu Gin Asn He Gin Arg Ala Asn Leu Phe Asn Lys Leu Val 

340 345 350 

Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu Leu Pro 

355 360 365 

Gin Leu He Glu Val Ser Ser Pro He Thr Leu Gin Ala Leu Val Gin 

370 375 380 

Cys Gly Gin Pro Gin Cys Ser Thr His He Leu Gin Trp Leu Lys Arg 
385 390 395 400 

Val His Ala Asn Pro Leu Leu He Asp Val Val Thr Tyr Leu Val Ala 

405 410 415 

Leu He Pro Glu Pro Ser Ala Gin Gin Leu Arg Glu He Phe Asn Met 

420 425 430 

Ala Arg Asp Gin Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser His Ala 

435 440 445 

Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu Leu Leu 

450 455 460 

Asp He Ala Asn Tyr Leu Met Glu Gin He Gin Asp Asp Cys Thr Gly 
465 470 475 480 

Asp Glu Asp Tyr Thr Tyr Leu He Leu Arg Val He Gly Asn Met Gly 

485 490 495 

Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser lie Leu Lys 

500 505 510 

Cys Val Gin Ser Thr Lys Pro Ser Leu Met He Gin Lys Ala Ala He 

515 520 525 

Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu Val Leu 

530 535 540 

Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg Leu Ala 
545 550 555 560 

Ala Tyr Leu Met Leu Met Arg Ser Pro Ser Gin Ala Asp He Asn Lys 

565 570 575 

He Val Gin He Leu Pro Trp Glu Gin Asn Glu Gin Val Lys Asn Phe 

580 585 590 

Val Ala Ser His He Ala Asn He Leu Asn Ser Glu Glu Leu Asp He 

595 600 605 

Gin Asp Leu Lys Lys Leu Val Lys Glu Ala Leu Lys Glu Ser Gin Leu 

610 615 620 

Pro Thr Val Met Asp Phe Arg Lys Phe Ser Arg Asn Tyr Gin Leu Tyr 
625 630 635 640 

Lys Ser Val Ser Leu Pro Ser Leu Asp Pro Ala Ser Ala Lys He Glu 

645 650 655 

Gly Asn Leu He Phe Asp Pro Asn Asn Tyr Leu Pro Lys Glu Ser Met 


665 


670 
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Leu Lys Thr Thr Leu Thr Ala Phe Gly Phe Ala Ser Ala Asp Leu He 

675 680 685 

Glu He Gly Leu Glu Gly Lys Gly Phe Glu Pro Thr Leu Glu Ala Leu 

690 695 700 

Phe Gly Lys Gin Gly Phe Phe Pro Asp Ser Val Asn Lys Ala Leu Tyr 
705 710 715 720 

Trp Val Asn Gly Gin Val Pro Asp Gly Val Ser Lys Val Leu Val Asp 

725 730 735 

His Phe Gly Tyr Thr Lys Asp Asp Lys His Glu Gin Asp Met Val Asn 

740 745 750 

Gly He Met Leu Ser Val Glu Lys Leu He Lys Asp Leu Lys Ser Lys 

755 760 765 

Glu Val Pro Glu Ala Arg Ala Tyr Leu Arg He Leu Gly Glu Glu Leu 

770 775 780 

Glv Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu Leu Leu 
785 790 795 800 

Met Gly Ala Arg Thr Leu Gin Gly He Pro Gin Met He Gly Glu Val 

80S 810 815 

He Arg Lys Gly Ser Lys Asn Asp Phe Phe Leu His Tyr He Phe Met 

820 825 830 

Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu Gin He 

835 840 845 

Ser Ser Ser Gly Val He Ala Pro Gly Ala Lys Ala Gly Val Lys Leu 

850 855 860 

Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser Val Ser 
865 870 875 880 

Val Glu Phe Val Thr Asn Met Gly He He He Pro Asp Phe Ala Arg 

885 890 895 

Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly Leu Glu 

900 90S 910 

Ala His Val Ala Leu Lys Ala Gly Lys Leu Lys Phe He He Pro Ser 

915 920 925 

Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu His Leu 

930 935 940 

Val Ser Thr Thr Lys Thr Glu Val He Pro Pro Leu He Glu Asn Arg 
945 950 955 960 

Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn Tyr Cys 

965 970 975 

Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala Ser Tyr 

980 985 990 

Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg Pro Thr 

995 1000 1005 

Gly Glu He Glu Gin Tyr Ser Val Ser Ala Thr Tyr Glu Leu Gin Arg 

1010 1015 1020 

Glu Asp Arg Ala Leu Val Asp Thr Leu Lys Phe Val Thr Gin Ala Glu 
1025 1030 1035 1040 

Glv Ala Lys Gin Thr Glu Ala Thr Met Thr Phe Lys Tyr Asn Arg Gin 

1 1045 1050 1055 

Ser Met Thr Leu Ser Ser Glu Val Gin He Pro Asp Phe Asp Val Asp 

1060 1065 1070 

Leu Gly Thr He Leu Arg Val Asn Asp Glu Ser Thr Glu Gly Lys Thr 

1075 1080 1085 

Ser Tyr Arg Leu Thr Leu Asp He Gin Asn Lys Lys He Thr Glu Val 

1090 1095 HOO 

Ala Leu Met Gly His Leu Ser Cys Asp Thr Lys Glu Glu Arg Lys He 
1105 H10 1H5 1120 

Lvs Gly Val He Ser He Pro Arg Leu Gin Ala Glu Ala Arg Ser Glu 
3 H25 H30 H35 
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Ile Leu Ala His Trp Ser Pro Ala Lys Leu Leu Leu Gin Met Asp Ser 

1140 1145 1150 

Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala Trp His 

1155 1160 1165 

Tyr Asp Glu Glu Lys lie Glu Phe Glu Trp Asn Thr Gly Thr Asn Val 

1170 1175 1180 

Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser Asp Tyr 
1185 1190 1195 120C 

Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His Arg Val 

1205 1210 1215 

Pro Glu Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu He Val 

1220 1225 1230 

Ala Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro Tyr Thr 

1235 1240 1245 

Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn Leu Gin 

1250 1255 1260 

Asn Met Gly Leu Pro Asp Phe His He Pro Glu Asn Leu Phe Leu Lys 
1265 1270 1275 128C 

Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser Leu Lys He 

1285 1290 1295 

Glu He Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu Lys Met 

1300 1305 1310 

Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val Gly Phe 

1315 1320 1325 

His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr He Pro Lys 

1330 1335 1340 

Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu Ser Thr 
1345 1350 1355 136( 

Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser Gly Gly 

1365 1370 1375 

Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His Met Lys 

1380 1385 1390 

Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly Ser Gly 

1395 1400 1405 

Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys Asp Gly 

1410 1415 1420 

Ser Leu Arg His Lys Phe Leu Asp Ser Asn He Lys Phe Ser His Val 
1425 1430 1435 144< 

Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu He Phe Asp 

1445 1450 1455 

Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His Leu Asp 

1460 1465 1470 

Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys He Asp Gly 

1475 1480 1485 

Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly Leu Ser 

1490 1495 1500 

Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser Asn Leu 
1505 1510 1515 1521 

Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin He Thr Gly Arg 

1525 1530 1535 

Tyr Glu Asp Gly -Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu Gin Ser 

1540 1545 1550 

Gly He He Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr Glu Leu 

1555 1560 1565 

Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala Thr Ser 

1570 1575 1580 

Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu Arg Ser 


1585 


1590 


1595 


1600 
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Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu Leu Ser 

1605 1610 1615 

Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp lie Leu Gly 

1620 1625 1630 

Thr Asp Lys He Asn Ser Gly Ala His Lys Ala Thr Leu Arg He Gly 

1635 1640 1645 

Gin Asp Gly He Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys Ser Leu 

1650 1655 . 1660 

Leu Val Leu Glu Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser Gly Ala 
1665 1670 1675 1680 

Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn Ala Lys 

1685 1690 1695 

Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu Gly Ser 

1700 1705 1710 

Ala Tyr Gin Ala Met He Leu Gly Val Asp Ser Lys Asn He Phe Asn 

1715 1720 1725 

Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met Met Gly 

1730 1735 1740 

Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn He Ala 
1745 1750 1755 1760 

Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn He Tyr Ser Ser 

1765 1770 1775 

Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro Tyr Ser 

1780 1785 1790 

Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu Asp Leu 

1795 1800 1805 

Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro Leu Lys Leu His Val Ala 

1810 1815 1820 

Glv Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu He Lys His He Tyr 
1825 1830 1835 1840 

Ala He Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp Thr Val 

1845 1850 1855 

Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr Asp He 

I860 1865 1870 

Ala Gly Leu Ala Ser Ala He Asp Met Ser Thr Asn Tyr Asn Ser Asp 

1875 1880 1885 

Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro Phe Thr 

1890 1895 1900 

Met Thr He Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala Leu Trp 
1905 1910 1915 1920 

Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys Ala Glu 

1925 1930 1935 

Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr Ser His 

1940 1945 1950 

His Leu Val Ser Arg Lys Ser He Ser Ala Ala Leu Glu His Lys Val 

1955 I960 1965 

Ser Ala Leu Leu Thr Pro Ala Glu Gin Thr Gly Thr Trp Lys Leu Lys 

1970 1975 1980 

Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala Tyr Asn 
1985 1990 1995 2000 

Thr Lys Asp Lys He Gly Val Glu Leu Thr Gly Arg Thr Leu Ala Asp 

2005 2010 2015 

Leu Thr Leu Leu Asp Ser Pro He Lys Val Pro Leu Leu Leu Ser Glu 

2020 2025 2030 

Pro He Asn He He Asp Ala Leu Glu Met Arg Asp Ala Val Glu Lys 

2035 2040 2045 

Pro Gin Glu Phe Thr He Val Ala Phe Val Lys Tyr Asp Lys Asn Gin 
2050 2055 2060 
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Asp Val His Ser He Asn Leu Pro Phe Phe Glu Thr Leu Gin Glu Tyr 
2065 2070 2075 2080 

Phe Glu Arg Asn Arg Gin Thr He He Val Val Val Glu Asn Val Gin 

2085 2090 2095 

Arg Asn Leu Lys His He Asn He Asp Gin Phe Val Arg Lys Tyr Arg 

2100 2105 2110 

Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu Asn Ser 

2115 2120 2125 

Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu Thr Ala 

2130 2135 2140 

Leu Thr Lys Lys Tyr Arg He Thr Glu Asn Asp lie Gin He Ala Leu 
2145 2150 2155 2160 

Asp Asp Ala Lys He Asn Phe Asn Glu Lys Leu Ser Gin Leu Gin Thr 

2165 2170 2175 

Tyr Met He Gin Phe Asp Gin Tyr He Lys Asp Ser Tyr Asp Leu His 

2180 2185 2190 

Asp Leu Lys He Ala He Ala Asn He He Asp Glu He He Glu Lys 

2195 2200 2205 

Leu Lys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu Val Lys 

2210 2215 2220 

Thr He His Asp Leu His Leu Phe He Glu Asn He Asp Phe Asn Lys 
2225 2230 2235 2240 

Ser Gly Ser Ser Thr Ala Ser Trp He Gin Asn Val Asp Thr Lys Tyr 

2245 2250 2255 

Gin He Arg He Gin He Gin Glu Lys Leu Gin Gin Leu Lys Arg His 

2260 2265 2270 

He Gin Asn He Asp He Gin His Leu Ala Gly Lys Leu Lys Gin His 

2275 2280 2285 

He Glu Ala He Asp Val Arg Val Leu Leu Asp Gin Leu Gly Thr Thr 

2290 2295 2300 

He Ser Phe Glu Arg He Asn Asp Val Leu Glu His Val Lys His Phe 
2305 2310 2315 2320 

Val He Asn Leu He Gly Asp Phe Glu Val Ala Glu Lys He Asn Ala 

2325 2330 2335 

Phe Arg Ala Lys Val His Glu Leu He Glu Arg Tyr Glu Val Asp Gin 

2340 2345 2350 

Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His Gin Tyr 

2355 2360 2365 

Lys Leu Lys Glu Thr lie Gin Lys Leu Ser Asn Val Leu Gin Gin Val 

2370 2375 2380 

Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe lie Asp Asp Ala 
2385 2390 2395 2400 

Val Lys Lys Leu Asn Glu Leu Ser Phe Lys Thr Phe lie Glu Asp Val 

2405 2410 2415 

Asn Lys Phe Leu Asp Met Leu lie Lys Lys Leu Lys Ser Phe Asp Tyr 

2420 2425 2430 

His Gin Phe Val Asp Glu Thr Asn Asp Lys lie Arg Glu Val Thr Gin 

2435 2440 2445 

Arg Leu Asn Gly Glu He Gin Ala Leu Glu Leu Pro Gin Lys Ala Glu 

2450 2455 2460 

Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala Val Tyr 
2465 2470 2475 2480 

Leu Glu Ser Leu Gin Asp Thr Lys lie Thr Leu lie lie Asn Trp Leu 

2485 2490 2495 

Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala Lys Phe 

2500 2505 2510 

Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met Asp lie 
2515 2520 2525 
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Gln Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val Tyr Ser 

2530 2535 2540 

Thr Leu Val Thr Tyr He Ser Asp Trp Trp Thr Leu Ala Ala Lys Asn 
2545 2550 2555 2560 

Leu Thr Asp Phe Ala Glu Gin Tyr Ser He Gin Asp Trp Ala Lys Arg 

2565 2570 2575 

Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu He Lys Thr 

2580 2585 2590 

He Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala Leu Gin 

2595 2600 2605 

Lys Ala Thr Phe Gin Thr Pro Asp Phe He Val Pro Leu Thr Asp Leu 

2610 2615 2620 

Arq lie Pro Ser Val Gin He Asn Phe Lys Asp Leu Lys Asn He Lys 
2625 2630 2635 2640 

He Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr He Leu Asn Thr Phe 

2645 2650 2655 

His He Pro Ser Phe Thr He Asp Phe Val Glu Met Lys Val Lys He 

2660 2665 2670 

He Arg Thr He Asp Gin Met Gin Asn Ser Glu Leu Gin Trp Pro Val 

2675 2680 2685 

Pro Asp He Tyr Leu Arg Asp Leu Lys Val Glu Asp He Pro Leu Ala 

2690 2695 2700 

Arg He Thr Leu Pro Asp Phe Arg Leu Pro Glu He Ala He Pro Glu 
2705 2710 2715 2720 

Phe He He Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro Asp Leu 

2725 2730 2735 

His He Pro Glu Phe Gin Leu Pro His He Ser His Thr He Glu Val 

2740 2745 2750 

Pro Thr Phe Gly Lys Leu Tyr Ser He Leu Lys He Gin Ser Pro Leu 

2755 2760 2765 

Phe Thr Leu Asp Ala Asn Ala Asp He Gly Asn Gly Thr Thr Ser Ala 

2770 2775 2780 

Asn Glu Ala Gly He Ala Ala Ser He Thr Ala Lys Gly Glu Ser Lys 
2785 2790 2795 2800 

Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu Ser Asn 

2805 2810 2815 

Pro Lys He Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe Ser Ser 

2820 2825 2830 

Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe Gly Asn 

2835 2840 2845 

Ala He Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr Glu Lys 

2850 2855 2860 

Asn Thr Leu Glu Leu Ser Asn Gly Val He Val Lys He Asn Asn Gin 
2865 2870 2875 2880 

Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn He Pro 

2885 2890 2895 

Lvs Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu He Lys Thr 

1 2900 2905 2910 

Leu Leu Lys Ala Gly His He Ala Trp Thr Ser Ser Gly Lys Gly Ser 

2915 2920 2925 

Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His Glu Ser 

2930 2935 2940 

Gin He Ser Phe Thr He Glu Gly Pro Leu Thr Ser Phe Gly Leu Ser 
2945 2950 2955 2960 

Asn Lys He Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu Val Tyr 

2965 2970 2975 

Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu He Gin Ser Gin Val 
2980 2985 2990 
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Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly Met Ala 

2995 3000 3005 

Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr Gly Arg His Asp Ala His 

3010 3015 3020 

Leu Asn Gly Lys Val He Gly Thr Leu Lys Asn Ser Leu Phe Phe Ser 
3025 3030 3035 3040 

Ala Gin Pro Phe Glu He Thr Ala Ser Thr Asn Asn Glu Gly Asn Leu 

3045 3050 3055 

Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys He Asp Phe Leu Asn 

3060 3065 3070 

Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser Trp Gin 

3075 3080 3085 

Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe Ser Ala 

3090 3095 3100 

Gly Asn Asn Glu Asn He Met Glu Ala His Val Gly He Asn Gly Glu 
3105 3110 3115 3120 

Ala Asn Leu Asp Phe Leu Asn He Pro Leu Thr He Pro Glu Met Arg 

3125 3130 3135 

Leu Pro Tyr Thr He He Thr Thr Pro Pro Leu Lys Asp Phe Ser Leu 

3140 3145 3150 

Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys Gin Ser 

3155 3160 3165 

Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His Arg His 

3170 3175 3180 

Ser He Thr Asn Pro Leu Ala Val Leu Cys Glu Phe He Ser Gin Ser 
3185 3190 3195 3200 

He Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn Ala Leu 

3205 3210 3215 

Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys He Lys Phe Asp Lys 

3220 3225 3230 

Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe Gin He 

3235 3240 3245 

Pro Gly Tyr Thr Val Pro Val Val Asn Val Glu Val Ser Pro Phe Thr 

3250 3255 3260 

He Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val Ser Met 
3265 3270 3275 3280 

Pro Ser Phe Ser He Leu Gly Ser Asp Val Arg Val Pro Ser Tyr Thr 

3285 3290 3295 

Leu lie Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro Arg Asn 

3300 3305 3310 

Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr He Ser His 

3315 3320 3325 

He Phe He Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser Phe Lys 

3330 3335 3340 

Ser Ser Val He Thr Leu Asn Thr Asn Ala Glu Leu Phe Asn Gin Ser 
3345 3350 3355 3360 

Asd He Val Ala His Leu Leu Ser Ser Ser Ser Ser Val He Asp Ala 

3365 3370 3375 

Leu Gin Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys Arg Gly 

3380 3385 3390 

Leu Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val Glu Gly 

3395 3400 3405 

Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu Val Ser 

3410 3415 3420 

Val Ala Lys Thr Thr Lys Ala Glu He Pro He Leu Arg Met Asn Phe 
3425 3430 3435 3440 

Lys Gin Glu Leu Asn Gly Asn Thr Lys Ser Lys Pro Thr Val Ser Ser 
3445 3450 3455 
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Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met Leu Tyr Ser Thr 

3460 3465 3470 

Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu Glu Ser Leu Thr Ser 

3475 3480 3485 

Tyr Phe Ser lie Glu Ser Ser Thr Lys Gly Asp Val Lys Gly Ser Val 

3490 3495 3500 

Leu Ser Arg Glu Tyr Ser Gly Thr He Ala Ser Glu Ala Asn Thr Tyr 
3505 3510 3S15 3520 

Leu Asn Ser Lys Ser Thr Arg Ser Ser Val Lys Leu Gin Gly Thr Ser 

3525 3530 3535 

Lys He Asp Asp He Trp Asn Leu Glu Val Lys Glu Asn Phe Ala Gly 

3540 3545 3550 

Glu Ala Thr Leu Gin Arg He Tyr Ser Leu Trp Glu His Ser Thr Lys 

3555 3560 3565 

Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu His Thr 

3570 3575 3580 

Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala Leu Val 
3585 3590 3595 3600 

Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro Asp Leu 

3605 3610 3615 

Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys He Arg 

3620 3625 3630 

Trp Lys Asn Glu Val Arg He His Ser Gly Ser Phe Gin Ser Gin Val 

3635 3640 3645 

Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp He Ala Gly Ser 

3650 3655 3660 

Leu Glu Gly His Leu Arg Phe Leu Lys Asn He He Leu Pro Val Tyr 
3665 3670 3675 3680 

Asd Lvs Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr Ser He 

* 3685 3690 3695 

Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr Thr Lys 

3700 3705 3710 

Asn Pro Asn Gly Tyr Ser Phe Ser He Pro Val Lys Val Leu Ala Asp 

3715 3720 3725 

Lys Phe He Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser Val Leu 

3730 3735 3740 

Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val Pro Ser 
3745 3750 3755 3760 

Cys Lys Leu Asp Phe Arg Glu He Gin He Tyr Lys Lys Leu Arg Thr 

* 3765 3770 3775 

Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys Phe Pro 

3780 3785 3790 

Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser Leu He 

3795 3800 3805 

Pro Phe Phe Glu He Thr Val Pro Glu Ser Gin Leu Thr Val Ser Gin 

3810 3815 3820 

Phe Thr Leu Pro Lys Ser Val Ser Asp Gly He Ala Ala Leu Asp Leu 
3825 3830 3835 3840 

Asn Ala Val Ala Asn Lys He Ala Asp Phe Glu Leu Pro Thr He He 

3845 3850 3855 

Val Pro Glu Gin Thr He Glu He Pro Ser He Lys Phe Ser Val Pro 

3860 3865 3870 

Ala Gly lie Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg Phe Glu 

3875 3880 3885 

Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu Lys Asn 

3890 3895 3900 

Lvs Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser Ser Thr 
3905 3910 3915 3920 
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Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His Lys lie 

3925 3930 3935 

Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala His Arg 

3940 3945 3950 

Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly Leu Gin 

3955 3960 3965 

Glu Trp Glu Gly Lys Ala His Leu Asn lie Lys Ser Pro Ala Phe Thr 

3970 3975 3980 

Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly lie Ser Thr Ser 
3985 3990 3995 4000 

Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp Glu Asp 

4005 4010 4015 

Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser Ser Pro 

4020 4025 4030 

Asp Lys Lys Leu Thr lie Phe Lys Thr Glu Leu Arg Val Arg Glu Ser 

4035 4040 4045 

Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala Ala Ser 

4050 4055 4060 

Gly Leu Leu Thr Ser Leu Lys Asp Asn Val Pro Lys Ala Thr Gly Val 
4065 4070 4075 4080 

Leu Tyr Asp Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly Leu Thr 

4085 4090 4095 

Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn Asn Ala 

4100 4105 4110 

Glu Trp Val Tyr Gin Gly Ala He Arg Gin He Asp Asp He Asp Val 

4115 4120 4125 

Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin Glu Trp 

4130 4135 4140 

Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin Glu Gly 
4145 4150 4155 4160 

Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly Leu Val 

4165 4170 4175 

Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu He Asp Ser 

4180 4185 4190 

Leu He Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly Lys Pro 

4195 4200 4205 

Gly He Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe He Arg Glu Val 

4210 4215 4220 

Gly Thr Val Leu Ser Gin Val Tyr Ser Lys Val His Asn Gly Ser Glu 
4225 4230 4235 4240 

He Leu Phe Ser Tyr Phe Gin Asp Leu Val He Thr Leu Pro Phe Glu 

4245 4250 4255 

Leu Arg Lys His Lys Leu He Asp Val He Ser Met Tyr Arg Glu Leu 

4260 4265 4270 

Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala He Gin 

4275 4280 4285 

Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu Leu Gin 

4290 4295 4300 

Phe He Phe Gin Leu He Glu Asp Asn He Lys Gin Leu Lys Glu Met 
4305 4310 4315 4320 

Lys Phe Thr Tyr Leu He Asn Tyr He Gin Asp Glu He Asn Thr He 

4325 4330 4335 

Phe Asn Asp Tyr He Pro Tyr Val Phe Lys Leu Leu Lys Glu Asn Leu 

4340 4345 4350 

Cys Leu Asn Leu His Lys Phe Asn Glu Phe He Gin Asn Glu Leu Gin 

4355 4360 4365 

Glu Ala Ser Gin Glu Leu Gin Gin lie His Gin Tyr He Met Ala Leu 
4370 4375 4380 
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Arg Glu Glu Tyr Phe Asp Pro Ser He Val Gly Trp Thr Val Lys Tyr 
4385 4390 4395 4401 

Tyr Glu Leu Glu Glu Lys He Val Ser Leu He Lys Asn Leu Leu Val 

4405 4410 4415 

Ala Leu Lys Asp Phe His Ser Glu Tyr He Val Ser Ala Ser Asn Phe 

4420 4425 4430 

Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg Asn He 


Gin Glu Tyr Leu Ser He Leu Thr Asp Pro Asp Gly Lys Gly Lys Glu 

4450 4455 4460 

Lys He Ala Glu Leu Ser Ala Thr Ala Gin Glu He He Lys Ser Gin 
4465 4470 4475 4480 

Ala He Ala Thr Lys Lys He He Ser Asp Tyr His Gin Gin Phe Arg 

4485 4490 4495 

Tyr Lys Leu Gin Asp Phe Ser Asp Gin Leu Ser Asp Tyr Tyr Glu Lys 

4500 4505 4510 

Phe He Ala Glu Ser Lys Arg Leu He Asp Leu Ser He Gin Asn Tyr 

4515 4520 4525 

His Thr Phe Leu He Tyr He Thr Glu Leu Leu Lys Lys Leu Gin Ser 

4530 4535 4540 

Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu Leu Thr 
4545 4550 4555 4560 

He He Leu 


<210> 33 
<211> 2196 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (13) . . . (1983) 

<223> Nucleotide sequence encoding 

5,10-methylenetetrahydofolate reductase (MTHFR) 

<400> 33 

aattccggag cc atg gtg aac gaa gcc aga gga aac age age etc aac ccc 
Met Val Asn Glu Ala Arg Gly Asn Ser Ser Leu Asn Pro 


tgc ttg gag ggc agt gcc age agt ggc agt gag age tec aaa gat agt 
Cys Leu Glu Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser Lys Asp Ser 
15 20 25 

teg aga tgt tec acc ccg ggc ctg gac cct gag egg cat gag aga etc 
Ser Arg Cys Ser Thr Pro Gly Leu Asp Pro Glu Arg His Glu Arg Leu 
30 35 40 45 

egg gag aag atg agg egg cga ttg gaa tct ggt gac aag tgg ttc tec 
Arg Glu Lys Met Arg Arg Arg Leu Glu Ser Gly Asp Lys Trp Phe Ser 
50 55 60 

ctg gaa ttc ttc cct cct cga act get gag gga get gtc aat etc ate 
Leu Glu Phe Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Leu He 


tea agg ttt gac egg atg gca gca ggt ggc ccc etc tac ata gac gtg 


51 
99 
147 
195 
243 
291 


4435 


4440 


4445 
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Ser Arg Phe Asp Arg Met Ala Ala Gly Gly Pro Leu Tyr lie Asp Val 
80 85 90 

acc tgg cac cca gca ggt gac cct ggc tea gac aag gag acc tec tec 
Thr Trp His Pro Ala Gly Asp Pro Gly Ser Asp Lys Glu Thr Ser Ser 
95 100 105 

atg atg ate gec age acc gee gtg aac tac tgt ggc ctg gag acc ate 
Met Met He Ala Ser Thr Ala Val Asn Tyr Cys Gly Leu Glu Thr He 
110 115 120 125 

ctg cac atg acc tgc tgc cgt cag cgc ctg gag gag ate acg ggc cat 
Leu His Met Thr Cys Cys Arg Gin Arg Leu Glu Glu He Thr Gly His 
130 135 140 

ctg cac aaa get aag cag ctg ggc ctg aag aac ate atg gcg ctg egg 
Leu His Lys Ala Lys Gin Leu Gly Leu Lys Asn He Met Ala Leu Arg 
145 150 155 

gga gac cca ata ggt gac cag tgg gaa gag gag gag gga ggc ttc aac 
Gly Asp Pro lie Gly Asp Gin Trp Glu Glu Glu Glu Gly Gly Phe Asn 
160 165 170 

tac gca gtg gac ctg gtg aag cac ate cga agt gag ttt ggt gac tac 
Tyr Ala Val Asp Leu Val Lys His He Arg Ser Glu Phe Gly Asp Tyr 
175 180 185 

ttt gac ate tgt gtg gca ggt tac ccc aaa ggc cac ccc gaa gca ggg 
Phe Asp He Cys Val Ala Gly Tyr Pro Lys Gly His Pro Glu Ala Gly 
190 195 200 205 

age ttt gag get gac ctg aag cac ttg aag gag aag gtg tct gcg gga 
Ser Phe Glu Ala Asp Leu Lys His Leu Lys Glu Lys Val Ser Ala Gly 
210 215 220 

gec gat ttc ate ate acg cag ctt ttc ttt gag get gac aca ttc ttc 
Ala Asp Phe He He Thr Gin Leu Phe Phe Glu Ala Asp Thr Phe Phe 
225 230 235 

cgc ttt gtg aag gca tgc acc gac atg ggc ate act tgc ccc ate gtc 
Arg Phe Val Lys Ala Cys Thr Asp Met Gly He Thr Cys Pro He Val 
240 245 250 

ccc ggg ate ttt ccc ate cag ggc tac cac tec ctt egg cag ctt gtg 
Pro Gly He Phe Pro He Gin Gly Tyr His Ser Leu Arg Gin Leu Val 
255 260 265 

aag ctg tec aag ctg gag gtg cca cag gag ate aag gac gtg att gag 
Lys Leu Ser Lys Leu Glu Val Pro Gin Glu He Lys Asp Val He Glu 
270 275 280 285 

cca ate aaa gac aac gat get gee ate cgc aac tat ggc ate gag ctg 
Pro He Lys Asp Asn Asp Ala Ala He Arg Asn Tyr Gly He Glu Leu 
290 295 300 

gec gtg age ctg tgc cag gag ctt ctg gec agt ggc ttg gtg cca ggc 
Ala Val Ser Leu Cys Gin Glu Leu Leu Ala Ser Gly Leu Val Pro Gly 


339 

387 

435 

483 

531 

579 

627 

675 

723 

771 

819 

867 

915 

963 


305 


310 


315 
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ctc cac ttc tac acc etc aac cgc gag atg get ace aca gag gtg ctg 1011 
Leu His Phe Tyr Thr Leu Asn Arg Glu Met Ala Thr Thr Glu Val Leu 
320 325 330 

aag cgc ctg ggg atg tgg act gag gac ccc agg cgt ccc eta ccc tgg 1059 
Lys Arg Leu Gly Met Trp Thr Glu Asp Pro Arg Arg Pro Leu Pro Trp 
335 340 345 

get etc agt gee cac ccc aag cgc cga gag gaa gat gta cgt ccc ate 1107 
Ala Leu Ser Ala His Pro Lys Arg Arg Glu Glu Asp Val Arg Pro lie 
350 355 360 365 

ttc tgg gee tec aga cca aag agt tac ate tac cgt acc cag gag tgg 1155 
Phe Trp Ala Ser Arg Pro Lys Ser Tyr He Tyr Arg Thr Gin Glu Trp 
370 375 380 


gac gag ttc cct aac ggc cgc tgg ggc aat tec tct tec cct gec ttt 
Asp Glu Phe Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala Phe 
385 390 395 


ggc ate etc acc ate aac tea cag ccc aac ate aac ggg aag ccg tec 
Gly He Leu Thr He Asn Ser Gin Pro Asn He Asn Gly Lys Pro Ser 
480 485 490 


1203 


ggg gag ctg aag gac tac tac etc ttc tac ctg aag age aag tec ccc 1251 
Gly Glu Leu Lys Asp Tyr Tyr Leu Phe Tyr Leu Lys Ser Lys Ser Pro 
400 405 410 

aag gag gag ctg ctg aag atg tgg ggg gag gag ctg acc agt gaa gca 1299 
Lys Glu Glu Leu Leu Lys Met Trp Gly Glu Glu Leu Thr Ser Glu Ala 
415 420 425 

agt gtc ttt gaa gtc ttt gtt ctt tac etc teg gga gaa cca aac egg 1347 
Ser Val Phe Glu Val Phe Val Leu Tyr Leu Ser Gly Glu Pro Asn Arg 
430 435 440 445 

aat ggt cac aaa gtg act tgc ctg ccc tgg aac gat gag ccc ctg gcg 1395 
Asn Gly His Lys Val Thr Cys Leu Pro Trp Asn Asp Glu Pro Leu Ala 
450 455 460 

get gag acc age ctg ctg aag gag gag ctg ctg egg gtg aac cgc cag 1443 
Ala Glu Thr Ser Leu Leu Lys Glu Glu Leu Leu Arg Val Asn Arg Gin 
465 470 475 


1491 


tec gac ccc ate gtg ggc tgg ggc ccc age ggg ggc tat gtc ttc cag 1539 

Ser Asp Pro He Val Gly Trp Gly Pro Ser Gly Gly Tyr Val Phe Gin 

495 500 505 

aag gee tac tta gag ttt ttc act tec cgc gag aca gcg gaa gca ctt 1587 

Lys Ala Tyr Leu Glu Phe Phe Thr Ser Arg Glu Thr Ala Glu Ala Leu 
510 515 520 525 

ctg caa gtg ctg aag aag tac gag etc egg gtt aat tac cac ctt gtc 1635 

Leu Gin Val Leu Lys Lys Tyr Glu Leu Arg Val Asn Tyr His Leu Val 
530 535 540 

aat gtg aag ggt gaa aac ate acc aat gee cct gaa ctg cag ccg aat 1683 
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Asn Val Lys Gly Glu Asn He Thr Asn Ala Pro Glu Leu Gin Pro Asn 
545 550 555 

get gtc act tgg ggc ate ttc cct ggg cga gag ate ate cag ccc acc 1731 
Ala Val Thr Trp Gly He Phe Pro Gly Arg Glu He He Gin Pro Thr 
560 565 570 

gta gtg gat ccc gtc age ttc atg ttc tgg aag gac gag gee ttt gee 1779 
Val Val Asp Pro Val Ser Phe Met Phe Trp Lys Asp Glu Ala Phe Ala 
575 580 585 

ctg tgg att gag egg tgg gga aag ctg tat gag gag gag tec ccg tec 1827 
Leu Trp He Glu Arg Trp Gly Lys Leu Tyr Glu Glu Glu Ser Pro Ser 
590 595 600 605 

cgc acc ate ate cag tac ate cac gac aac tac ttc ctg gtc aac ctg 1875 
Arg Thr He He Gin Tyr lie His Asp Asn Tyr Phe Leu Val Asn Leu 
610 615 620 

gtg gac aat gac ttc cca ctg gac aac tgc etc tgg cag gtg gtg gaa 1923 
Val Asp Asn Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu 
625 630 635 


gac aca ttg gag ctt etc aac agg ccc acc cag aat gcg aga gaa acg 
Asp Thr Leu Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr 
640 645 650 


<400> 34 













Glu 

Met 

Val 

Asn 

Glu 

Ala 

Arg 

Gly 

Asn 

Ser 

Ser 

Leu 

Asn 

Pro 

Cys 

Leu 

1 




5 



10 





15 


Gly 

Ser 

Ala 

Ser 

Ser 

Gly 

Ser 

Glu 

Ser 

Ser 

Lys 

Asp 

Ser 

Ser 

Arg Cys 



20 





25 





30 

Glu 


Ser 

Thr 

Pro 

Gly 

Leu 

Asp 

Pro 

Glu 

Arg 

His 

Glu 

Arg 

Leu 

Arg 

Lys 



35 




40 





45 


Glu 

Phe 

Met 

Arg 

Arg 

Arg 

Leu 

Glu 

Ser 

Gly 

Asp 

Lys 

Trp 

Phe 

Ser 

Leu 


50 




55 





60 




Phe 

Phe 

Pro 

Pro 

Arg 

Thr 

Ala 

Glu 

Gly 

Ala 

Val 

Asn 

Leu 

He 

Ser 

Arg 

65 




70 





75 





80 

Asp 

Arg 

Met 

Ala 

Ala 

Gly 

Gly 

Pro 

Leu 

Tyr 

lie 

Asp 

Val 

Thr 

Trp His 



85 





90 





95 

He 

Pro 

Ala 

Gly 

Asp 

Pro 

Gly 

Ser 

Asp 

Lys 

Glu 

Thr 

Ser 

Ser 

Met 

Met 



100 





105 





110 

His 


Ala 

Ser 

Thr 

Ala 

Val 

Asn 

Tyr 

Cys 

Gly 

Leu 

Glu 

Thr 

He 

Leu 

Met 


115 





120 





125 


His 


Thr 

Cys 

Cys 

Arg 

Gin 

Arg 

Leu 

.Glu 

Glu 

He 

Thr 

Gly 

His 

Leu 

Lys 


1971 


gag get cca tga ccctgcgtcc tgacgccctg cgttggagcc actcctgtcc 2023 
Glu Ala Pro * 
655 

cgccttcctc ctccacagtg ctgcttctct tgggaactcc actctccttc gtgtctctcc 2083 

caccccggcc tccactcccc cacctgacaa tggcagctag actggagtga ggcttccagg 2143 

ctcttcctgg acctgagtcg gccccacatg ggaacctagt actctctgct eta 2196 

<210> 34 

<211> 656 

<212> PRT 

<213> Homo sapien 


WO 02/072604 




PCT/US02/06728 


-73- 


130 


135 


140 


Ala Lys Gin Leu Gly Leu Lys Asn lie Met Ala Leu Arg Gly Asp Pro 
145 150 155 160 

He Gly Asp Gin Trp Glu Glu Glu Glu Gly Gly Phe Asn Tyr Ala Val 

* F 165 170 175 

Asp Leu Val Lys His He Arg Ser Glu Phe Gly Asp Tyr Phe Asp He 

180 185 190 

Cys Val Ala Gly Tyr Pro Lys Gly His Pro Glu Ala Gly Ser Phe Glu 

195 200 205 

Ala Asp Leu Lys His Leu Lys Glu Lys Val Ser Ala Gly Ala Asp Phe 

210 215 220 

He He Thr Gin Leu Phe Phe Glu Ala Asp Thr Phe Phe Arg Phe Val 
225 230 235 240 

Lys Ala Cys Thr Asp Met Gly He Thr Cys Pro He Val Pro Gly He 

245 250 255 

Phe Pro He Gin Gly Tyr His Ser Leu Arg Gin Leu Val Lys Leu Ser 

260 265 270 

Lys Leu Glu Val Pro Gin Glu He Lys Asp Val lie Glu Pro He Lys 

275 280 285 

Asp Asn Asp Ala Ala He Arg Asn Tyr Gly He Glu Leu Ala Val Ser 

290 295 300 

Leu Cys Gin Glu Leu Leu Ala Ser Gly Leu Val Pro Gly Leu His Phe 
305 310 315 320 

Tvr Thr Leu Asn Arg Glu Met Ala Thr Thr Glu Val Leu Lys Arg Leu 

* 325 330 335 

Glv Met Trp Thr Glu Asp Pro Arg Arg Pro Leu Pro Trp Ala Leu Ser 

340 345 350 

Ala His Pro Lys Arg Arg Glu Glu Asp Val Arg Pro He Phe Trp Ala 

355 360 365 

Ser Arg Pro Lys Ser Tyr He Tyr Arg Thr Gin Glu Trp Asp Glu Phe 

370 375 380 

Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala Phe Gly Glu Leu 
385 390 395 400 

Lvs Asp Tyr Tyr Leu Phe Tyr Leu Lys Ser Lys Ser Pro Lys Glu Glu 

r 405 410 415 

Leu Leu Lys Met Trp Gly Glu Glu Leu Thr Ser Glu Ala Ser Val Phe 

420 425 430 

Glu Val Phe Val Leu Tyr Leu Ser Gly Glu Pro Asn Arg Asn Gly His 

435 440 445 

Lys Val Thr Cys Leu Pro Trp Asn Asp Glu Pro Leu Ala Ala Glu Thr 

450 455 460 

Ser Leu Leu Lys Glu Glu Leu Leu Arg Val Asn Arg Gin Gly He Leu 
465 470 475 480 

Thr He Asn Ser Gin Pro Asn He Asn Gly Lys Pro Ser Ser Asp Pro 

485 490 495 

He Val Gly Trp Gly Pro Ser Gly Gly Tyr Val Phe Gin Lys Ala Tyr 

500 505 510 

Leu Glu Phe Phe Thr Ser Arg Glu Thr Ala Glu Ala Leu Leu Gin Val 

515 520 525 

Leu Lys Lys Tyr Glu Leu Arg Val Asn Tyr His Leu Val Asn Val Lys 

530 535 540 

Glv Glu Asn He Thr Asn Ala Pro Glu Leu Gin Pro Asn Ala Val Thr 
545 550 555 560 

Trp Gly lie Phe Pro Gly Arg Glu He He Gin Pro Thr Val Val Asp 

565 570 575 

Pro Val Ser Phe Met Phe Trp Lys Asp Glu Ala Phe Ala Leu Trp He 

580 585 590 

Glu Arg Trp Gly Lys Leu Tyr Glu Glu Glu Ser Pro Ser Arg Thr He 
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595 600 605 

He Gin Tyr He His Asp Asn Tyr Phe Leu Val Asn Leu Val Asp Asn 

610 615 620 

Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu Asp Thr Leu 
625 630 635 640 

Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr Glu Ala Pro 
645 650 655 

<210> 35 
<211> 3B34 
<212> DNA 
<213> Homo sapien 

<220> 

<221> CDS 

<222> (117) . . . (1949) 

<223> Nucleotide sequence encoding selectin E (SELE) 

<400> 35 _ 
cctgagacag aggcagcagt gatacccacc tgagagatcc tgtgtttgaa caactgcttc 
ccaaaacgga aagtatttca agcctaaacc tttgggtgaa aagaactctt gaagtc atg 

1 


50 


tea cca agt tat tac tgg att gga ate aga aaa gtc aac aat gtg tgg 
Ser Pro Ser Tyr Tyr Trp He Gly He Arg Lys Val Asn Asn Val Trp 
70 75 80 

qtc tgg gta gga acc cag aaa cct ctg aca gaa gaa gec aag aac tgg 
Val Trp Val Gly Thr Gin Lys Pro Leu Thr Glu Glu Ala Lys Asn Trp 
85 90 95 

get cca ggt gaa ccc aac aat agg caa aaa gat gag gac tgc gtg gag 
Ala Pro Gly Glu Pro Asn Asn Arg Gin Lys Asp Glu Asp Cys Val Glu 
100 105 HO 

ate tac ate aag aga gaa aaa gat gtg ggc atg tgg aat gat gag agg 
He Tyr He Lys Arg Glu Lys Asp Val Gly Met Trp Asn Asp Glu Arg 
lis 120 125 

tqc age aag aag aag ctt gee eta tgc tac aca get gee tgt acc aat 
Cys Ser Lys Lys Lys Leu Ala Leu Cys Tyr Thr Ala Ala Cys Thr Asn 


60 
119 


167 


att get tea cag ttt etc tea get etc act ttg gtg ctt etc att aaa 
He Ala Ser Gin Phe Leu Ser Ala Leu Thr Leu Val Leu Leu He Lys 
5 10 15 

qaq agt gga gee tgg tct tac aac acc tec acg gaa get atg act tat 
Glu Ser Gly Ala Trp Ser Tyr Asn Thr Ser Thr Glu Ala Met Thr Tyr 
20 25 30 

gat gag gee agt get tat tgt cag caa agg tac aca cac ctg gtt gca 
Asp Glu Ala Ser Ala Tyr Cys Gin Gin Arg Tyr Thr His Leu Val Ala 
35 40 45 

att caa aac aaa gaa gag att gag tac eta aac tec ata ttg age tat 311 
He Gin Asn Lys Glu Glu He Glu Tyr Leu Asn Ser He Leu Ser Tyr 
55 60 65 


215 


263 


359 


407 


455 


503 


551 
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X30 


135 


140 


145 


aca tec tgc agt ggc cac ggt gaa tgt gta gag acc ate aat aat tac 
Thr Ser Cys Ser Gly His Gly Glu Cys Val Glu Thr lie Asn Asn Tyr 
150 155 160 . 

act tgc aag tgt gac cct ggc ttc agt gga etc aag tgt gag caa att 
Thr Cys Lys Cys Asp Pro Gly Phe Ser Gly Leu Lys Cys Glu Gin lie 
165 170 175 

gtg aac tgt aca gee ctg gaa tec cct gag cat gga age ctg gtt tgc 
Val Asn Cys Thr Ala- Leu Glu Ser Pro Glu His Gly Ser Leu Val Cys 
180 185 190 

agt cac cca ctg gga aac ttc age tac aat tct tec tgc tct ate age 
Ser His Pro Leu Gly Asn Phe Ser Tyr Asn Ser Ser Cys Ser He Ser 
195 200 205 

tgt gat agg ggt tac ctg cca age age atg gag acc atg cag tgt atg 
Cvs Asp Arq Gly Tyr Leu Pro Ser Ser Met Glu Thr Met Gin Cys Met 
210 215 220 225 

tec tct gga gaa tgg agt get cct att cca gee tgc aat gtg gtt gag 
Ser Ser Gly Glu Trp Ser Ala Pro He Pro Ala Cys Asn Val Val Glu 
230 235 240 

tqt gat get gtg aca aat cca gec aat ggg ttc gtg gaa tgt ttc caa 


Ala Val Thr Asn Pro Ala Asn Gly Phe Val Glu Cys Phe Gin 


aac cct gga age ttc cca tgg aac aca acc tgt aca ttt gac tgt gaa 
Asn Pro Gly Ser Phe Pro Trp Asn Thr Thr Cys Thr Phe Asp Cys Glu 
260 265 270 

gaa gga ttt gaa eta atg gga gee cag age ctt cag tgt acc tea tct 
Glu Gly Phe Glu Leu Met Gly Ala Gin Ser Leu Gin Cys Thr Ser Ser 
275 280 285 

ggg aat tgg gac aac gag aag cca acg tgt aaa get gtg aca tgc agg 
Gly Asn Trp Asp Asn Glu Lys Pro Thr Cys Lys Ala Val Thr Cys Arg 
290 295 300 305 

gec gtc cgc cag cct cag aat ggc tct gtg agg tgc age cat tec cct 
Ala Val Arg Gin Pro Gin Asn Gly Ser Val Arg Cys Ser His Ser Pro 
310 315 320 

get gga gag ttc acc ttc aaa tea tec tgc aac ttc acc tgt gag gaa 
Ala Gly Glu Phe Thr Phe Lys Ser Ser Cys Asn Phe Thr Cys Glu Glu 
325 330 335 

ggc ttc atg ttg cag gga cca gee cag gtt gaa tgc acc act caa ggg 
Gly Phe Met Leu Gin Gly Pro Ala Gin Val Glu Cys Thr Thr Gin Gly 
340 345 350 

cag tgg aca cag caa ate cca gtt tgt gaa get ttc cag tgc aca gee 
Gin Trp Thr Gin Gin He Pro Val Cys Glu Ala Phe Gin Cys Thr Ala 


599 
647 
695 
743 
791 
839 
887 
935 
983 
1031 
1079 
1127 
1175 
1223 


355 


360 


365 
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ttg tec aac ccc gag cga ggc tac atg aat tgt ctt cct agt get tct 1271 
Leu Ser Asn Pro Glu Arg Gly Tyr Met Asn Cys Leu Pro Ser Ala Ser 
370 375 380 385 

ggc agt ttc cgt tat ggg tec age tgt gag ttc tec tgt gag cag ggt 1319 
Gly Ser Phe Arg Tyr Gly Ser Ser Cys Glu Phe Ser Cys Glu Gin Gly 
390 395 400 

ttt gtg ttg aag gga tec aaa agg etc caa tgt ggc ccc aca ggg gag 1367 
Phe Val Leu Lys Gly Ser Lys Arg Leu Gin Cys Gly Pro Thr Gly Glu 
405 410 415 

tgg gac aac gag aag ccc aca tgt gaa get gtg aga tgc gat get gtc 1415 
Tip Asp Asn Glu Lys Pro Thr Cys Glu Ala Val Arg Cys Asp Ala Val 
420 425 430 

cac cag ccc ccg aag ggt ttg gtg agg tgt get cat tec cct att gga 1463 
His Gin Pro Pro Lys Gly Leu Val Arg Cys Ala His Ser Pro He Gly 
435 440 445 

gaa ttc acc tac aag tec tct tgt gee ttc age tgt gag gag gga ttt 1511 
Glu Phe Thr Tyr Lys Ser Ser Cys Ala Phe Ser Cys Glu Glu Gly Phe 
450 455 460 465 


gaa tta tat gga tea act caa ctt gag tgc aca tct cag gga caa tgg 
Glu Leu Tyr Gly Ser Thr Gin Leu Glu Cys Thr Ser Gin Gly Gin Trp 
470 475 480 


1559 


aca gaa gag gtt cct tec tgc caa gtg gta aaa tgt tea age ctg gca 1607 
Thr Glu Glu Val Pro Ser Cys Gin Val Val Lys Cys Ser Ser Leu Ala 
485 490 495 

gtt ccg gga aag ate aac atg age tgc agt ggg gag ccc gtg ttt ggc 1655 
Val Pro Gly Lys He Asn Met Ser Cys Ser Gly Glu Pro Val Phe Gly 
500 505 510 

act gtg tgc aag ttc gee tgt cct gaa gga tgg acg etc aat ggc tct 1703 
Thr Val Cys Lys Phe Ala Cys Pro Glu Gly Trp Thr Leu Asn Gly Ser 
S15 520 525 

gca get egg aca tgt gga gee aca gga cac tgg tct ggc ctg eta cct 1751 
Ala Ala Arg Thr Cys Gly Ala Thr Gly His Trp Ser Gly Leu Leu Pro 
530 535 540 545 

acc tgt gaa get ccc act gag tec aac att ccc ttg gta get gga ctt 1799 
Thr Cys Glu Ala Pro Thr Glu Ser Asn He Pro Leu Val Ala Gly Leu 
550 555 560 

tct get get gga etc tec etc ctg aca tta gca cca ttt etc etc tgg 1847 
Ser Ala Ala Gly Leu Ser Leu Leu Thr Leu Ala Pro Phe Leu Leu Trp 
565 570 575 

ctt egg aaa tgc tta egg aaa gca aag aaa ttt gtt cct gec age age 1895 
Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser Ser 
580 585 590 

tgc caa age ctt gaa tea gac gga age tac caa aag cct tct tac ate 1943 
Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr He 
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595 


600 


605 


ctt taa gttcaaaaga atcagaaaca ggtgcatctg gggaactaga gggatacact 

Leu * 

610 

gaagttaaca gagacagata actctcctcg ggtctctggc ccttcttgcc tactatgcca 
gatgccttta tggctgaaac cgcaacaccc atcaccactt caatagatca aagtccagca 
ggcaaggacg gccttcaact gaaaagactc agtgttccct ttcctactct caggatcaag 
aaagtgttgg ctaatgaagg gaaaggatat tttcttccaa gcaaaggtga agagaccaag 
actctgaaat ctcagaattc cttttctaac tctcccttgc tcgctgtaaa atcttggcac 
agaaacacaa tattttgtgg ctttctttct tttgcccttc acagtgtttc gacagctgat 
tacacagttg ctgtcataag aatgaataat aattatccag agtttagagg aaaaaaatga 
ctaaaaatat tataacttaa aaaaatgaca gatgttgaat gcccacaggc aaatgcatgg 
agggttgtta atggtgcaaa tcctactgaa tgctctgtgc gagggttact atgcacaatt 
taatcacttt catccctatg ggattcagtg cttcttaaag agttcttaag gattgtgata 
tttttacttg cattgaatat attataatct tccatacttc ttcattcaat acaagtgtgg 
tagggactta aaaaacttgt aaatgctgtc aactatgata tggtaaaagt tacttattct 
agattacccc ctcattgttt attaacaaat tatgttacat ctgttttaaa tttatttcaa 
aaagggaaac tattgtcccc tagcaaggca tgatgttaac cagaataaag ttctgagtgt 
ttttactaca gttgtttttt gaaaacatgg tagaattgga gagtaaaaac tgaatggaag 
gtttgtatat tgtcagatat tttttcagaa atatgtggtt tccacgatga aaaacttcca 
tgaggccaaa cgttttgaac taataaaagc ataaatgcaa acacacaaag gtataatttt 
atgaatgtct ttgttggaaa agaatacaga aagatggatg tgctttgcat tcctacaaag 
atgtttgtca gatgtgatat gtaaacataa ttcttgtata ttatggaaga ttttaaattc 
acaatagaaa ctcaccatgt aaaagagtca tctggtagat ttttaacgaa tgaagatgtc 
taatagttat tccctatttg ttttcttctg tatgttaggg tgctctggaa gagaggaatg 
cctgtgtgag caagcattta tgtttattta taagcagatt taacaattcc aaaggaatct 
ccagttttca gttgatcact ggcaatgaaa aattctcagt cagtaattgc caaagctgct 
ctagccttga ggagtgtgag aatcaaaact ctcctacact tccattaact tagcatgtgt 
tgaaaaaaaa agtttcagag aagttctggc tgaacactgg caacgacaaa gccaacagtc 
aaaacagaga tgtgataagg atcagaacag cagaggttct tttaaagggg cagaaaaact 
ctgggaaata agagagaaca actactgtga tcaggctatg tatggaatac agtgttattt 
tctttgaaat tgtttaagtg ttgtaaatat ttatgtaaac tgcattagaa attagctgtg 
tgaaatacca gtgtggtttg tgtttgagtt ttattgagaa ttttaaatta taacttaaaa 
tattttataa tttttaaagt atatatttat ttaagcttat gtcagaccta tttgacataa 
cactataaag gttgacaata aatgtgctta tgttt 

<210> 36 

<211> 610 

<212> PRT 

<213> Homo sapien 


Met lie Ala Ser Gin Phe Leu Ser Ala Leu Thr Leu Val Leu Leu lie 

1 5 10 15 

Lys Glu Ser Gly Ala Trp Ser Tyr Asn Thr Ser Thr Glu Ala Met Thr 

20 25 30 

Tvr Asp Glu Ala Ser Ala Tyr Cys Gin Gin Arg Tyr Thr His Leu Val 

35 40 45 

Ala lie Gin Asn Lys Glu Glu He Glu Tyr Leu Asn Ser He Leu Ser 

50 55 60 

Tvr Ser Pro Ser Tyr Tyr Trp He Gly He Arg Lys Val Asn Asn Val 
65 70 75 80 

Trp Val Trp Val Gly Thr Gin Lys Pro Leu Thr Glu Glu Ala Lys Asn 

85 90 95 

Trp Ala Pro Gly Glu Pro Asn Asn Arg Gin Lys Asp Glu Asp Cys Val 


1999 


2059 

2119 

2179 

2239 

2299 

2359 

2419 

2479 

2539 

2599 

2659 

2719 

2779 

2839 

2899 

2959 

3019 

3079 

3139 

3199 

3259 

3319 

3379 

3439 

3499 

3559 

3619 

3679 

3739 

3799 

3834 


100 


105 


110 
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Glu lie Tyr lie Lys Arg Glu Lys Asp Val Gly Met Trp Asn Asp Glu 

115 120 125 

Arg Cys Ser Lys Lys Lys Leu Ala Leu Cys Tyr Thr Ala Ala Cys Thr 

130 135 140 

Asn Thr Ser Cys Ser Gly His Gly Glu Cys Val Glu Thr He Asn Asn 
145 150 155 160 

Tyr Thr Cys Lys Cys Asp Pro Gly Phe Ser Gly Leu Lys Cys Glu Gin 

165 170 175 

He Val Asn Cys Thr Ala Leu Glu Ser Pro Glu His Gly Ser Leu Val 

180 185 190 

Cys Ser His Pro Leu Gly Asn Phe Ser Tyr Asn Ser Ser Cys Ser He 

195 200 205 

Ser Cys Asp Arg Gly Tyr Leu Pro Ser Ser Met Glu Thr Met Gin Cys 

210 215 220 

Met Ser Ser Gly Glu Trp Ser Ala Pro He Pro Ala Cys Asn Val Val 
225 230 235 240 

Glu Cys Asp Ala Val Thr Asn Pro Ala Asn Gly Phe Val Glu Cys Phe 

245 250 255 

Gin Asn Pro Gly Ser Phe Pro Trp Asn Thr Thr Cys Thr Phe Asp Cys 

260 265 270 

Glu Glu Gly Phe Glu Leu Met Gly Ala Gin Ser Leu Gin Cys Thr Ser 

275 280 285 

Ser Gly Asn Trp Asp Asn Glu Lys Pro Thr Cys Lys Ala Val Thr Cys 

290 295 300 

Arg Ala Val Arg Gin Pro Gin Asn Gly Ser Val Arg Cys Ser His Ser 
305 310 315 320 

Pro Ala Gly Glu Phe Thr Phe Lys Ser Ser Cys Asn Phe Thr Cys Glu 

325 330 335 

Glu Gly Phe Met Leu Gin Gly Pro Ala Gin Val Glu Cys Thr Thr Gin 

340 345 350 

Gly Gin Trp Thr Gin Gin He Pro Val Cys Glu Ala Phe Gin Cys Thr 

355 360 365 

Ala Leu Ser Asn Pro Glu Arg Gly Tyr Met Asn Cys Leu Pro Ser Ala 

370 375 380 

Ser Gly Ser Phe Arg Tyr Gly Ser Ser Cys Glu Phe Ser Cys Glu Gin 
385 390 395 400 

Gly Phe Val Leu Lys Gly Ser Lys Arg Leu Gin Cys Gly Pro Thr Gly 

405 410 415 

Glu Trp Asp Asn Glu Lys Pro Thr Cys Glu Ala Val Arg Cys Asp Ala 

420 425 430 

Val His Gin Pro Pro Lys Gly Leu Val Arg Cys Ala His Ser Pro He 

435 440 445 

Gly Glu Phe Thr Tyr Lys Ser Ser Cys Ala Phe Ser Cys Glu Glu Gly 

450 455 460 

Phe Glu Leu Tyr Gly Ser Thr Gin Leu Glu Cys Thr Ser Gin Gly Gin 
465 470 475 480 

Trp Thr Glu Glu Val Pro Ser Cys Gin Val Val Lys Cys Ser Ser Leu 

485 490 495 

Ala Val Pro Gly Lys He Asn Met Ser Cys Ser Gly Glu Pro Val Phe 

500 505 510 

Gly Thr Val Cys Lys Phe Ala Cys Pro Glu Gly Trp Thr Leu Asn Gly 

515 520 525 

Ser Ala Ala Arg Thr Cys Gly Ala Thr Gly His Trp Ser Gly Leu Leu 

530 535 540 

Pro Thr Cys Glu Ala Pro Thr Glu Ser Asn He Pro Leu Val Ala Gly 
545 550 555 560 

Leu Ser Ala Ala Gly Leu Ser Leu Leu Thr Leu Ala Pro Phe Leu Leu 


565 


570 


575 
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Trp Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser 

580 585 590 

Ser Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr 
595 600 605 

lie Leu 
610 

<210> 37 
<211> 1922 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (406) . . . (1428) , 
<223> Nucleotide sequence encoding nucleotide binding 
protein (G Protein), beta polypeptide 3 (GNB3) 


<400> 37 

ccacaatagg 

acaggatcag 

agtcctttct 

accaccctga 

ggccaggcca 

cgtcgcagct 

ccagccagag 


ggcagacctg 
acccagaggc 
aatctcagct 
gctgaggagc 
ggccagctcc 
gagggagtaa 
cccaagagcc 


tccatccttc 
agctggttgg 
cctgcctgta 
acagtttgag 
tctggcagca 
ggaggctccc 
agagtgaccc 


tctgtgggtc 
ggtttgtcga 
ccctcccata 
gcccccccaa 
gagcctgggc 
aggaaccgga 
ctcgacctgt 


ccctgtacct 
gaagaaggat 
ctcaccaaac 
ccccccgccg 
aggtgacggg 
gctggaaacc 
cagcc atg 
Met 
1 


ttctccccca 
tatccagatc 
cctcttcccc 
gtcggggcca 
cgggcgcggg 
cggccgaggt 
ggg gag atg 

Gly Glu Met 


60 
120 
180 
240 
300 
360 
417 


gag caa ctg cgt cag gaa gcg gag cag etc aag aag cag att gca gat 
Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Lys Gin lie Ala Asp 
5 10 15 20 

gec agg aaa gec tgt get gac gtt act ctg gca gag ctg gtg tct ggc 
Ala Arg Lys Ala Cys Ala Asp Val Thr Leu Ala Glu Leu Val Ser Gly 
25 30 35 

eta gag gtg gtg gga cga gtc cag atg egg acg egg egg acg tta agg 
Leu Glu Val Val Gly Arg Val Gin Met Arg Thr Arg Arg Thr Leu Arg 
40 45 50 

gga cac ctg gee aag att tac gec atg cac tgg gec act gat tct aag 
Gly His Leu Ala Lys He Tyr Ala Met His Trp Ala Thr Asp Ser Lys 
55 60 65 

ctg ctg gta agt gec teg caa gat ggg aag ctg ate gtg tgg gac age 
Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He Val Trp Asp Ser 
70 75 80 

tac ace acc aac aag gtg cac gee ate cca ctg cgc tec tec tgg gtc 
Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg Ser Ser Trp Val 
85 90 95 100 


90 


atg acc tgt gee tat gee cca tea ggg aac ttt gtg gca tgt ggg ggg 
Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Phe Val Ala Cys Gly Gly 
105 HO H5 

ctg gac aac atg tgt tec ate tac aac etc aaa tec cgt gag ggc aat 


465 


513 


561 


609 


657 


705 


753 


801 
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Leu Asp Asn Met Cys Ser lie Tyr Asn Leu Lys Ser Arg Glu Gly Asn 
120 125 130 

gtc aag gtc age egg gag ctt tct get cac aca ggt tat etc tec tgc 
Val Lys Val Ser Arg Glu Leu Ser Ala His Thr Gly Tyr Leu Ser Cys 
135 140 145 

tgc cgc ttc ctg gat gac aac aat att gtg ace age teg ggg gac ace 
Cys Arg Phe Leu Asp Asp Asn Asn He Val Thr Ser Ser Gly Asp Thr 
150 155 160 

acg tgt gee ttg tgg gac att gag act ggg cag cag aag act gta ttt 
Thr Cys Ala Leu Trp Asp He Glu Thr Gly Gin Gin Lys Thr Val Phe 
165 170 175 180 

gtg gga cac acg ggt gac tgc atg age ctg get gtg tct cct gac ttc 
Val Gly His Thr Gly Asp Cys Met Ser Leu Ala Val Ser Pro Asp Phe 
185 190 195 

aat etc ttc att teg ggg gec tgt gat gee agt gec aag etc tgg gat 
Asn Leu Phe He Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Trp Asp 


gtg cga gag ggg acc tgc cgt cag act ttc act ggc cac gag teg gac 
Val Arg Glu Gly Thr Cys Arg Gin Thr Phe Thr Gly His Glu Ser Asp 
215 220 225 

ate aac gec ate tgt ttc ttc ccc aat gga gag gec ate tgc acg ggc 
He Asn Ala He Cys Phe Phe Pro Asn Gly Glu Ala He Cys Thr Gly 
230 235 240 

teg gat gac get tec tgc cgc ttg ttt gac ctg egg gca gac cag gag 
Ser Asp Asp Ala Ser Cys Arg Leu Phe Asp Leu Arg Ala Asp Gin Glu 
245 250 255 260 

ctg ate tgc ttc tec cac gag age ate ate tgc ggc ate acg tec gtg 
Leu He Cys Phe Ser His Glu Ser He He Cys Gly He Thr Ser Val 
265 270 275 

gee ttc tec etc agt ggc cgc eta eta ttc get ggc tac gac gac ttc 
Ala Phe Ser Leu Ser Gly Arg Leu Leu Phe Ala Gly Tyr Asp Asp Phe 


aac tgc aat gtc tgg gac tec atg aag tct gag cgt gtg ggc ate etc 
Asn Cys Asn Val Trp Asp Ser Met Lys Ser Glu Arg Val Gly He Leu 
295 300 305 

tct ggc cac gat aac agg gtg age tgc ctg gga gtc aca get gac ggg 
Ser Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Ala Asp Gly 
310 315 320 

atg get gtg gee aca ggt tec tgg gac age ttc etc aaa ate tgg aac 
Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys He Trp Asn 
325 330 335 340 

tga ggaggctgga gaaagggaag tggaaggcag tgaacacact cagcagcccc 


849 
897 
945 
993 
1041 
1089 
1137 
1185 
1233 
1281 
1329 
1377 
1425 
1478 
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ctgcccgacc ccatctcatt caggtgttct cttctatatt ccgggtgcca ttcccactaa 1538 

gctttctcct ttgagggcag tggggagcat gggactgtgc ctttgggagg cagcatcagg 1598 

gacacagggg caaagaactg ccccatctcc tcccatggcc ttccctcccc acagtcctca 1658 

cagcctctcc cttaatgagc aaggacaacc tgcccctccc cagccctttg caggcccagc 1718 

agacttgagt ctgaggcccc aggccctagg attcctcccc cagagccact acctttgtcc 1778 

aggcctgggt ggtatagggc gtttggccct gtgactatgg ctctggcacc actagggtcc 1838 

tggccctctt cttattcatg ctttctcctt tttctacctt tttttctctc ctaagacacc 1898 

tgcaataaag tgtagcaccc tggt 1922 

<210> 38 

<211> 340 

<212> PRT 

<213> Homo sapien 

<400> 38 _ _ 

Met Gly Glu Met Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Lys 

15 10 15 

Gin He Ala Asp Ala Arg Lys Ala Cys Ala Asp Val Thr Leu Ala Glu 

20 25 30 

Leu Val Ser Gly Leu Glu Val Val Gly Arg Val Gin Met Arg Thr Arg 

35 40 45 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Ala 

50 55 60 

Thr Asp Ser Lys Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He 
65 70 75 80 

Val Trp Asp Ser Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg 

85 90 95 

Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Phe Val 

100 105 HO 

Ala Cys Gly Gly Leu Asp Asn Met Cys Ser He Tyr Asn Leu Lys Ser 

115 120 125 

Arg Glu Gly Asn Val Lys Val Ser Arg Glu Leu Ser Ala His Thr Gly 

130 135 140 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Asn He Val Thr Ser 
145 150 155 160 

Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp He Glu Thr Gly Gin Gin 

165 170 175 

Lys Thr Val Phe Val Gly His Thr Gly Asp Cys Met Ser Leu Ala Val 

180 185 190 

Ser Pro Asp Phe Asn Leu Phe He Ser Gly Ala Cys Asp Ala Ser Ala 

195 200 205 

Lys Leu Trp Asp Val Arg Glu Gly Thr Cys Arg Gin Thr Phe Thr Gly 

210 215 220 

His Glu Ser Asp He Asn Ala He Cys Phe Phe Pro Asn Gly Glu Ala 
225 230 235 240 

He Cys Thr Gly Ser Asp Asp Ala Ser Cys Arg Leu Phe Asp Leu Arg 

245 250 255 

Ala Asp Gin Glu Leu He Cys Phe Ser His Glu Ser He lie Cys Gly 

260 265 270 

lie Thr Ser Val Ala Phe Ser Leu Ser Gly Arg Leu Leu Phe Ala Gly 

275 280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Ser Met Lys Ser Glu Arg 

290 295 300 

Val Gly lie Leu Ser Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 
305 310 315 320 

Thr Ala Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 
325 330 335 
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Lys He Trp Asn 
340 

<210> 39 

<211> 2443 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (162) . . . (1253) 

<223> Nucleotide sequence encoding angiotensin receptor 
2 (AGTR2) 

<400> 39 

acgtcccagc gtctgagaga acgagtaagc aagaattcaa agcattctgc agcctgaatt 60 
ttgaaggagt gtgtttaggc actaagcaag ctgatttatg ataactgctt taaacttcaa 120 
caaccaaagg cataagaact aggagctgct gacatttcaa t atg aag ggc aac tec 176 

Met Lys Gly Asn Ser 
1 5 

acc ctt gec act act age aaa aac att acc age ggt ctt cac ttc ggg 224 
Thr Leu Ala Thr Thr Ser Lys Asn He Thr Ser Gly Leu His Phe Gly 
10 IS 20 

ctt gtg aac ate tct ggc aac aat gag tct acc ttg aac tgt tea cag 272 
Leu Val Asn He Ser Gly Asn Asn Glu Ser Thr Leu Asn Cys Ser Gin 
25 30 35 

aaa cca tea gat aag cat tta gat gca att cct att ctt tac tac att 320 
Lys Pro Ser Asp Lys His Leu Asp Ala He Pro He Leu Tyr Tyr He 
40 45 50 

ata ttt gta att gga ttt ctg gtc aat att gtc gtg gtt aca ctg ttt 368 
He Phe Val He Gly Phe Leu Val Asn He Val Val Val Thr Leu Phe 
55 60 65 

tgt tgt caa aag ggt cct aaa aag gtt tct age ata tac ate ttc aac 416 
Cys Cys Gin Lys Gly Pro Lys Lys Val Ser Ser He Tyr He Phe Asn 
70 75 80 85 

etc get gtg get gat tta etc ctt ttg get act ctt cct eta tgg gca 464 
Leu Ala Val Ala Asp Leu Leu Leu Leu Ala Thr Leu Pro Leu Trp Ala 
90 95 100 

acc tat tat tct tat aga tat gac tgg etc ttt gga cct gtg atg tgc 512 
Thr Tyr Tyr Ser Tyr Arg Tyr Asp Trp Leu Phe Gly Pro Val Met Cys 
105 HO 115 

aaa gtt ttt ggt tct ttt ctt acc ctg aac atg ttt gca age att ttt 560 
Lys Val Phe Gly Ser Phe Leu Thr Leu Asn Met Phe Ala Ser He Phe 
120 125 130 

ttt ate acc tgc atg agt gtt gat agg tac caa tct gtc ate tac ccc 608 
Phe He Thr Cys Met Ser Val Asp Arg Tyr Gin Ser Val He Tyr Pro 
135 140 145 

ttt ctg tct caa aga aga aat ccc tgg caa gca tct tat ata gtt ccc 656 
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Phe Leu Ser Gin Arg Arg Asn Pro Trp Gin Ala Ser Tyr He Val Pro 
150 155 160 165 

ctt gtt tgg tgt atg gcc tgt ttg tec tea ttg cca aca ttt tat ttt 
Leu Val Trp Cys Met Ala Cys Leu Ser Ser Leu Pro Thr Phe Tyr Phe 
170 175 180 

cga gac gtc aga acc att gaa tac tta gga gtg aat get tgc att atg 
Arg Asp Val Arg Thr He Glu Tyr Leu Gly Val Asn Ala Cys He Met 
185 190 195 

get ttc cca cct gag aaa tat gcc caa tgg tea get ggg att gcc tta 
Ala Phe Pro Pro Glu Lys Tyr Ala Gin Trp Ser Ala Gly He Ala Leu 
200 205 210 

atg aaa aat ate ctt ggt ttt att ate cct tta ata ttc ata gca aca 
Met Lys Asn He Leu Gly Phe He He Pro Leu He Phe He Ala Thr 
215 220 225 

tgc tat ttt gga att aga aaa cac tta ctg aag acg aat age tat ggg 
Cvs Tyr Phe Gly He Arg Lys His Leu Leu Lys Thr Asn Ser Tyr Gly 
230 235 240 245 

aag aac agg ata acc cgt gac caa gtc ctg aag atg gca get get gtt 944 
Lys Asn Arg He Thr Arg Asp Gin Val Leu Lys Met Ala Ala Ala Val 
1 250 255 260 


gtt ctg gcc ttc ate att tgg tgc ctt ccc ttc cat gtt ctg acc ttc 
Val Leu Ala Phe He He Trp Cys Leu Pro Phe His Val Leu Thr Phe 
265 270 275 

ctg gat get ctg gcc tgg atg ggt gtc att aat age tgc gaa gtt ata 
Leu Asp Ala Leu Ala Trp Met Gly Val He Asn Ser Cys Glu Val He 
285 290 


280 


704 


752 


800 


848 


896 


992 


1040 


1088 


1136 


gca gtc att gac ctg gca ctt cct ttt gcc ate etc ttg gga ttc acc 
Ala Val He Asp Leu Ala Leu Pro Phe Ala He Leu Leu Gly Phe Thr 
295 300 305 

aac age tgc gtt aat ccg ttt ctg tat tgt ttt gtt gga aac egg ttc 
Asn Ser Cys Val Asn Pro Phe Leu Tyr Cys Phe Val Gly Asn Arg Phe 
310 315 320 325 

caa cag aag etc cgc agt gtg ttt agg gtt cca att act tgg etc caa 1184 
Gin Gin Lys Leu Arg Ser Val Phe Arg Val Pro He Thr Trp Leu Gin 

- « "1 i c 1AC\ 


330 335 340 

ggg aaa aga gag agt atg tct tgc egg aaa age agt tct ctt aga gaa 
Gly Lys Arg Glu Ser Met Ser Cys Arg Lys Ser Ser Ser Leu Arg Glu 
345 350 355 

atg gag acc ttt gtg tct taa aeggagagea aaatgcatgt aatcaacatg 
Met Glu Thr Phe Val Ser * 
360 

getacttget ttgaggctca ccagaattat ttttaagtgg ttttaataaa ataataaaat 1343 
ttcccctaat cttttctgaa tcttctgaaa ccaaatgtaa ctatgtttat cgtccagtga 1403 
ctttcaggaa tgcccattgt tttctgatat gtttgtacaa gatttcattg gtgagacata 1463 


1232 


1283 
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tttacaacct 
taatgatttg 
atatccattt 
atttacaaga 
ttagtactgg 
tcctcttgca 
ggctaggaat 
agcaagatgt 
gaataagcac 
ttctctgata 
cttttcacat 
tttattatta 
actctttaac 
tgctacatgg 
aaaggaacct 
taccctatat 
aaaccaattg 


agaagtaact 
gggattcaga 
ttatcaggat 
caacattgta 
attattcagg 
tttcacttga 
atagattaaa 
actataacat 
tttttaaaaa 
cttttttgaa 
ccttgacttt 
atgctttggt 
ttgtaataaa 
ggtctgggtg 
gtcagggcag 
ctataaacac 
caggtctaga 


ggtgatatat 
tttctctttg 
ttcctcttga 
agagagatga 
ctttaggcat 
gtggaggttt 
tcatactcct 
agaattgcaa 
actttctact 
atcagtaaac 
ttagatgtgc 
tctgggttgt 
cccttaactg 
ggggcaaaga 
tacaatgtga 
tgtttgttcc 


ctcaaattgt 
aaacatgctt 
accagaacca 
gcacttctaa 
atgcttcttt 
atagttaatc 
atgctttagc 
tctataatat 
cattttaatg 
actgtgtatt 
tgctttgata 
ttcctaaaat 
gcataggaaa 
gacccagtca 
ctttgaaaat 
agaatctgta 


aattaataat 
gtgtttctta 
gtctttcaac 
gttgagtata 
aaaaacgcta 
tataactaca 
ttatttttac 
ttgtgtgttc 
attgtttaaa 
gttgtaaaat 
tataggacat 
atctgggtgg 
tggtatccag 
attacatgtt 
atataccgtg 
tgattctatg 


agattgtgaa 
gtggggtttt 
tcattgcatc 
ttataataga 
taaattatat 
tattgaatag 
agttatagaa 
actaaactct 
ggtttctatt 
gtaaaggtca 
tgatttgatt 
cttaaaaaaa 
aatggaattt 
tggtaccaag 
ggggtagttt 
gagctatttt 


<210> 40 

<211> 363 

<212> PRT 

<213> Homo sapien 

<400> 40 

Met Lys Gly Abii Ser Thr 

1 5 
Gly Leu His Phe Gly Leu 
20 

Leu Asn Cys Ser Gin Lys 
35 

He Leu Tyr Tyr He He 
50 

Val Val Thr Leu Phe Cys 
65 70 
He Tyr lie Phe Asn Leu 
85 

Leu Pro Leu Trp Ala Thr 
100 

Gly Pro Val Met Cys Lys 
115 

Phe Ala Ser He Phe Phe 
130 

Ser Val He Tyr Pro Phe 
145 150 
Ser Tyr He Val Pro Leu 
165 

Pro Thr Phe Tyr Phe Arg 
180 

Asn Ala Cys He Met Ala 
195 

Ala Gly He Ala Leu Met 
210 

He Phe He Ala Thr Cys 
225 230 
Thr Asn Ser Tyr Gly Lys 
245 

Met Ala Ala Ala Val Val 
260 


Leu Ala Thr 

Val Asn He 
25 

Pro Ser Asp 
40 

Phe Val lie 
55 

Cys Gin Lys 

Ala Val Ala 

Tyr Tyr Ser 
105 

Val Phe Gly 

120 
He Thr Cys 
135 

Leu Ser Gin 

Val Trp Cys 

Asp Val Arg 
185 

Phe Pro Pro 

200 
Lys Asn He 
215 

Tyr Phe Gly 

Asn Arg lie 

Leu Ala Phe 
265 


Thr Ser 
10 

Ser Gly 

Lys His 

Gly Phe 

Gly Pro 

75 
Asp Leu 
90 

Tyr Arg 

Ser Phe 

Met Ser 

Arg Arg 
155 
Met Ala 
170 

Thr lie 

Glu Lys 

Leu Gly 

He Arg 
235 
Thr Arg 
250 

lie lie 


Lys Asn 

Asn Asn 

Leu Asp 

45 
Leu Val 
60 

Lys Lys 
Leu Leu 
Tyr Asp 


He Thr Ser 
15 

Glu Ser Thr 
30 

Ala lie Pro 

Asn lie Val 

Val Ser Ser 
80 

Leu Ala Thr 
95 

Trp Leu Phe 
110 

Leu Asn Met 


Leu Thr 
125 

Val Asp Arg Tyr Gin 
140 

Asn Pro 


Cys Leu 

Glu Tyr 

Tyr Ala 
205 
Phe He 
220 

Lys His 
Asp Gin 
Trp Cys 


Trp Gin Ala 
160 

Ser Ser Leu 

175 
Leu Gly Val 
190 

Gin Trp Ser 

lie Pro Leu 

Leu Leu Lys 
240 

Val Leu Lys 

255 
Leu Pro Phe 
270 


1523 
1583 
1643 
1703 
1763 
1823 
1883 
1943 
2003 
2063 
2123 
2183 
2243 
2303 
2363 
2423 
2443 
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His 

Val 

Leu 

Thr 

Phe Leu Asp Ala Leu 



275 


280 

Ser 

Cys 

Glu 

Val 

lie Ala Val He Asp 


290 



295 

Leu 

Leu 

Gly 

Phe 

Thr Asn Ser Cys Val 

305 



310 

Val 

Gly 

Asn 

Arg 

Phe Gin Gin Lys Leu 





325 

lie 

Thr 

Trp 

Leu 

Gin Gly Lys Arg Glu 



340 

345 

Ser 

Ser 

Leu 

Arg 

Glu Met Glu Thr Phe 



355 

360 


<210> 41 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 41 

actgcctgat aaccatgctg 

<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 42 

atacttacac accaggaggg 

<210> 43 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 43 

atgcctgctc caaaggcac 

<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 44 

atgcctgctc caaaggcacc 


Ala Trp Met Gly Val He Asn 
285 

Leu Ala Leu Pro Phe Ala He 
300 

Asn Pro Phe Leu Tyr Cys Phe 
315 320 
Arg Ser Val Phe Arg Val Pro 
330 335 
Ser Met Ser Cys Arg Lys Ser 
350 

Val Ser 


20 


20 


19 


<210> 45 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 45 

atgcctgctc caaaggcaca t 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 46 

tacttctggt tctctgagcg 

<210> 47 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 47. 

actcaccttg aactcgtctc 

<210> 48 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 48 

tggttctctg agcgagtctt 

<210> 49 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 49 

tggttctctg agcgagtctt c 

<210> 50 

<211> 22 

<212> DNA 

<213> Artificial Sequence 


WO 02/072604 
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<220> 

<223> Primer 
<400> 50 

tggttctctg agcgagtctt tc 

<210> 51 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 51 

tgcagatgga ctttggcttc 

<210> 52 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 52 

tgcttgcctt ctgctacaag 

<210> 53 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 53 

cttccctgag cacctgctg 

<210> 54 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 54 

cttccctgag cacctgctgg t 

<210> 55 

<211> 20 

<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 
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<400> 55 

cttccctgag cacctgctga 20 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4O0> 56 

aacagctcag gacgaaactg zu 

<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 57 

agaaggagtt gaccttgtcc 

<210> 58 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<4O0> 58 19 
ggaagctcaa gtggccttc 

<210> 59 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<4O0> 59 

ggaagctcaa gtggccttcc 

<210> 60 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 60 

ggaagctcaa gtggccttca ac 


22 
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<210> 61 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 61 

aagtcactgg cagagctgg 

<210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 62 

gcaccagggc tttgttgaag 

<210> 63 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 63 

ttttccccgt agggctcca 

<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 64 

ttttccccgt agggctccac 

<210> 65 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 65 

ttttccccgt agggctccag c 

<210> 66 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 66 

tgcagaagtc actggcagag 

<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 67 

gttgaagttt tccccgtagg 

<210> 68 
<211> 19 
<212> DNA 

<213> Aritificial sequence 
<400> 68 

actcctccac ctgctggtc 

<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 69 

actcctccac ctgctggtcc 

<210> 70 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 70 

actcctccac ctgctggtct a 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 71 
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aggacgtgcg tggcaacctg 

<210> 72 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 72 

agctctgcca gtgacttctg 

<210> 73 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 73 

gtgacttctg cagcccctc 

<210> 74 
<211> 20 
<212> DNA 

<213> Artitifical sequence 
<400> 74 

gtgacttctg cagcccctca 

<210> 75 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 75 

gtgacttctg cagcccctcg gt 

<210> 76 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 76 

cctgaccttc cagatgaag 

<210> 77 
<211> 19 
<212> DNA 

<213> Artificial Sequence 


-91- 

20 


20 


19 


20 


22 
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<220> 

<223> Primer 
<400> 77 

tcaggttgcc acgcacgtc 

<210> 78 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 78 

caggatctcg gccagtgc 

<210> 79 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 79 

caggatctcg gccagtgcc 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 80 

caggatctcg gccagtgctg 

<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 61 

acctgcgaga gcttcagcag 

<210> 82 
<211> 20 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 
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<400> 82 

tctccatgcg ctgtgcgtag 

<210> 83 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 83 

agctgcgcac ccaggtca 

<210> 84 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 84 

agctgcgcac ccaggtcaa 

<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 85 

agctgcgcac ccaggtcagc 

<210> 86 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 86 

tgtccaagga gctgcaggc 

<210> 87 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 87 

cttacgcagc ttgcgcaggt 


20 
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<210> 88 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 88 

gcggacatgg aggacgtg 

<210> 89 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 89 

gcggacatgg aggacgtgc 

<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 90 

gcggacatgg aggacgtgtg 

<210> 91 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 91 

gttgtagaaa gaaccgctgc 

<210> 92 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 92 

gagaacgagt cttcaggtac 

<210> 93 
<211> 21 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 93 

acaatctggg ctatgagatc a 21 

<210> 94 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 94 

acaatctggg ctatgagatc aa 22 

<210> 95 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 95 

acaatctggg ctatgagatc agt 23 

<210> 96 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 96 

cactctacac tgcatgtctc zu 

<210> 97 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 97 

acccttctga aaaggagagg 20 

<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 


<220> 


WO 02/072604 
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<223> Primer 


<400> 98 

gaggagagac aaggcagata 


20 


<210> 99 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 99 

gaggagagac aaggcagata t 21 

<210> 100 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 100 

gaggagagac aaggcagata gt 22 

<210> 101 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 101 

aaaggttcag ttgctgctgc 20 

<210> 102 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 102 

gctggggaag gtctaataac 20 

<210> 103 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 103 
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gttgctgctg cctcgaatc 


19 


<210> 104 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 104 

gttgctgctg cctcgaatcc 20 

<210> 105 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 105 

gttgctgctg cctcgaatct g 21 

<210> 106 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 106 

cgtctttctc cagatgatgc 20 

<210> 107 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 107 

agtgtcctat gggctgtttg 20 

<210> 108 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 108 

ggatgccatt cataccttta c 21 


<210> 109 


WO 02/072604 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 109 

ggatgccatt cataccttta cc 22 

<210> 110 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 110 

ggatgccatt cataccttta cgc 23 

<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 111 

tgggaaaaca gtgcagtgtg 20 

<210> 112 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 112 

tgatcgtctt cagaacgagg 20 

<210> 113 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 113 

ccagaccatc atcatcccat gga 22 

<210> 114 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 


<400> 114 

ccagaccatc atcccatgga a 


21 


<210> 115 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 115 

ccagaccatc atcccatgga gc 22 

<210> 116 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 116 

cagcaatcgt ctttctccag 20 

<210> 117 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 117 

tcctatgggc tgtttgatgc 20 

<210> 118 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 118 

gtctttctcc agatgatgcc a 21 

<210> 119 
<211> 22 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 
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<400> 119 

gtctttctcc agatgatgcc aa 


22 


<210> 120 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 120 

gtctttctcc agatgatgcc agt 23 

<210> 121 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 121 

agcggataac aatttcacac agg 23 

<210> 122 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 122 

ggcgcacgcc tccacg 16 
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